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(54) GL50 molecules and uses therefor 

(57) The invention provides isolated nucleic acids 
molecules, designated GL50 nucleic acid molecules, 
which encode GL50 polypeptides. The invention also 
provides antisense nucleic acid molecules, recombinant 
expression vectors containing GL50 nucleic acid mole- 
cules, host cells into which the expression vectors have 
been introduced, and nonhuman transgenic animals in 



which a GL50 gene has been introduced or disrupted. 
The invention still further provides isolated GL50 
polpypeptides, fusion proteins, antigenic peptides and 
anti-GL50 antibodies. Diagnostic, screening, and thera- 
peutic methods utilizing compositions of the invention are 
also provided. 
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Description 

Background of the Invention 

5 [0001] In order for T cells to respond to foreign proteins, two signals must be provided by antigen-presenting cells 
(APCs) to resting T lymphocytes (Jenkins, M. and Schwartz, R. (1987) J. Exp. Med. 165:302-319; Mueller, D. L. et al. 

(1990) J. Immunol. 144:3701-3709). The first signal, which confers specificity to the immune response, is transduced 
via the T cell receptor (TCR) following recognition of foreign antigenic peptide presented in the context of the major 
histocompatibility complex (MHC). The second signal, termed costimulation, induces T cells to proliferate and become 

10 functional (Lenschow et al. (1996) Annu. Rev. Immunol. 14:233). Costimulation is neither antigen-specific, nor MHC 
restricted and is thought to be provided by one or more distinct cell surface molecules expressed by APCs (Jenkins, M. 
K. et al. (1988) J. Immunol. 140:3324-3330; Linsley, P. S. et al. (1991) J. Exp. Med. 173:721-730; Gimmi, C. D. et al. 

(1991) Proc. Natl. Acad. Sci. USA 88:6575-6579; Young, J. W. et al. (1992) J. Clin. Invest 90:229-237; Koulova, L. et 
al. (1 991 ) J. Exp. Med. 1 73:759-762; Reiser, H. et al. (1 992) Proc. Natl. Acad. Sci. USA 89:271 -275; van-Seventer, G. 

15 A. etal. (1990) J. Immunol. 144:4579-4586; LaSalle, J. M.etal. (1991) J. Immunol. 147:774-80; Dustin, M. I. etal. (1989) 
J. Exp. Med. 1 69:503; Armitage, R. J. et al. (1 992) Nature 357:80-82; Liu, Y. et al. (1 992) J. Exp. Med. 1 75:437-445). 
[0002] The CD80 (B7- 1 ) and CD86 (B7-2) proteins, expressed on APCs, are critical costimulatory molecules (Freeman 
et al. (1991) J. Exp. Med. 174:625; Freeman et al. (1989) J. Immunol. 143:2714; Azuma et al. (1993) Nature 366:76; 
Freeman et al. (1993) Science 262:909). B7-2 appears to play a predominant role during primary immune responses, 

20 while B7-1 , which is upregulated later in the course of an immune response, may be important in prolonging primary T 
cell responses or costimulating secondary T cell responses (Bluestone (1995) Immunity 2:555). 

[0003] One ligand to which B7-1 and B7-2 bind, CD28, is constitutively expressed on resting T cells and increases in 
expression after activation. After signaling through the T cell receptor, ligation of CD28 and transduction of a costimulatory 
signal induces T cells to proliferate and secrete IL-2 (Linsley, P. S. et al. (1 991 ) J. Exp. Med 1 73:721 -730; Gimmi, C. D. 

25 et al. (1 991 ) Proc. Natl. Acad. Sci. USA 88:6575-6579; June, C. H. et al. (1 990) Immunol. Today 11:211 -6; Harding, F. 
A. et al. (1992) Nature 356:607-609). A second ligand, termed CTLA4 (CD152) is homologous to CD28 but is not 
expressed on resting T cells and appears following T cell activation (Brunet, J. F. et al. (1987) Nature 328:267-270). 
CTLA4 appears to be critical in negative regulation of T cell responses (Waterhouse et al. (1995) Science 270:985). 
Blockade of CTLA4 has been found to remove inhibitory signals, while aggregation of GTLA4 has been found to provide 

30 inhibitory signals that downregulate T cell responses (Allison and Krummel (1 995) Science 270:932). The B7 molecules 
have a higher affinity for CTLA4 than for CD28 (Linsley, P. S. et al. (1991) J. Exp. Med. 174:561-569) and B7-1 and 
B7-2 have been found to bind to distinct regions of the CTLA4 molecule and have different kinetics of binding to CTLA4 
(Linsley et al. (1 994) Immunity 1 :793). 

[0004] I n the past, reports of the existence of additional members of the B7 costimulatory family have been controversial. 
35 The antibody BB-1 , appeared to recognize a subset of cells greater than either B7-1 or B7-2 positive cells, arguing for 

the existence of another B7-family member, B7-3. The identity of B7-3 had been in part thought to be answered by 

expression cloning of T-cell receptor invariant chain using the BB-1 antibody. Although invariant chain is not related to 

the B7 family, this molecule facilitated a low degree of costimulation when assessed by T cell proliferation assays. 

[0005] Very recently, a novel surface receptor termed ICOS was described which had sequence identity with CD28 
*o (24%) and CTLA4 (17%) (Hutloff et al. (1999) Nature 397:263; WO 98/38216). Unlike CD28, ICOS was shown to be 

upregulated on stimulated T cells and caused the secretion of a panel of cytokines distinct from those mediated by CD28 

costimulation (Hutloff et al. (1999) Nature 397:263). 

[0006] The importance of the B7:CD28/CTLA4 costimulatory pathway has been demonstrated in vitro and in several 
in vivo model systems. Blockade of this costimulatory pathway results in the development of antigen specific tolerance 

45 in murine and human systems (Harding, F. A. et al. (1992) Nature 356:607-609; Lenschow, D. J. et al. (1992) Science 
257:789-792; Turka, L. A. et al. (1992) Proc. Natl. Acad. Sci. USA 89:11102-11105; Gimmi, C. D. et al. (1993) Proc. 
Natl. Acad. Sci. USA 90:6586-6590; Boussiotis, V. et al. (1993) J. Exp. Med. 178:1753-1763). Conversely, expression 
of B7 by B7 negative murine tumor cells induces T-cell mediated specific immunity accompanied by tumor rejection and 
long lasting protection to tumor challenge (Chen, L. et al. (1 992) Cell 71 :1 093-1 1 02; Townsend, S. E. and Allison, J. P. 

50 (1 993) Science 259:368-370; Baskar, S. et al. (1 993) Proc. Natl. Acad. Sci. USA 90:5687-5690.). Therefore, manipulation 
of the costimulatory pathways offers great potential to stimulate or suppress immune responses in humans. 

Summary of the Invention 

55 [0007] The present invention is based, at least in part, on the discovery of novel nucleic acid molecules and polypeptides 
encoded by such nucleic acid molecules, referred to herein as GL50 molecules. Preferred GL50 molecules include 
antigens on the surface of professional antigen presenting cells (e.g., B lymphocytes, monocytes, dendritic cells, Lang- 
erhan cells) and other antigen presenting cells (e.g., keratinocytes, endothelial cells, astrocytes, fibroblasts, oligodendro- 
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cytes), which costimulate T cell proliferation, bind to costimulatory receptors ligands on T cells (e.g., CD28, CTLA4, 
and/or ICOS) and/or are bound by antibodies which recognize B7 family members, e.g., anti-GL50 antibodies. 
[0008] The GL50 nucleic acid and polypeptide molecules of the present invention are useful, e.g., in modulating the 
immune response. Accordingly, in one aspect, this invention provides isolated nucleic acid molecules encoding GL50 
5 polypeptides, as well as nucleic acid fragments suitable as primers or hybridization probes for the detection of GL50- 
encoding nucleic acids. 

[0009] In one embodiment, a GL50 nucleic acid molecule of the invention is at least 50%, 55%, 60%, 65%, 70%, 75%, 
80%, 85%, 90%, 95%, 98% or more identical to a nucleotide sequence (e.g., to the entire length of the nucleotide 
sequence) including SEQ ID NO:1 , 3, or 5, or a complement thereof. 
10 [0010] In a preferred embodiment, the isolated nucleic acid molecule includes the nucleotide sequence shown SEQ 
ID NO:1, 3, or 5, or a complement thereof. In another preferred embodiment, an isolated nucleic acid molecule of the 
invention encodes the amino acid sequence of a GL50 polypeptide. 

[001 1] Another embodiment of the invention features nucleic acid molecules, preferably the GL50 nucleic acid mole- 
cules, which specifically detect the GL50 nucleic acid molecules relative to nucleic acid molecules encoding non-GL50 
15 polypeptides. For example, in one embodiment, such a nucleic acid molecule is at least 20, 30, 40, 50, 100, 150, 200, 
250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, or 800 nucleotides in length and hybridizes under stringent 
conditions to a nucleic acid molecule comprising the nucleotide sequence shown in SEQ ID NO:1 , 3, or 5, or a complement 
thereof. 

[0012] In other preferred embodiments, nucleic acid molecules of the invention encode naturally occurring allelic 
20 variants of a human GL50 polypeptide, wherein the nucleic acid molecules hybridize to a nucleic acid molecule which 
includes SEQ ID NO:1 , 3, or 5 under stringent conditions. 

[0013] Another embodiment of the invention provides an isolated nucleic acid molecule which is antisense to a GL50 
nucleic acid molecule, e.g., the coding strand of a GL50 nucleic acid molecule. 

[0014] Another aspect of the invention provides a vector comprising a GL50 nucleic acid molecule. In certain embod- 
25 iments, the vector is a recombinant expression vector. In another embodiment, the invention provides a host cell containing 

a vectorof the invention. The invention also provides a method for producing apolypeptide, preferably a GL50 polypeptide, 

by culturing in a suitable medium, a host cell, e.g., a mammalian host cell such as a non-human mammalian cell; of the 

invention containing a recombinant expression vector, such that the polypeptide is produced. 

[0015] Another aspect of this invention features isolated or recombinant GL50 polypeptides and proteins. 
30 [0016] In one embodiment, the isolated polypeptide is a human GL50 polypeptide. 

[0017] In yet another embodiment, the isolated GL50 polypeptide is a soluble GL50 polypeptide. 

[0018] In a further embodiment, the isolated GL50 polypeptide is expressed on the surface of a cell, e.g., has a 

transmembrane domain. 

[0019] In a further embodiment, the isolated GL50 polypeptide plays a role in costimulating the cytokine secretion 
35 and/or proliferation of activated T cells. In another embodiment, the isolated GL50 polypeptide is encoded by a nucleic 
acid molecule having a nucleotide sequence which hybridizes under stringent hybridization conditions to a nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NO:1 , 3, or 5. 

[0020] Another embodiment of the invention features an isolated polypeptide, preferably a GL50 polypeptide, which 
is encoded by a nucleic acid molecule having a nucleotide sequence at least about 50%, 55%, 60%, 65%, 70%, 75%, 
^o 80%, 85%, 90%, 95%, 98% or more identity to a nucleotide sequence (e.g., to the entire length of the nucleotide sequence) 
including SEQ ID NO:1 ,3, or 5 or a complement thereof. 

[0021] Another embodiment of the invention features an isolated polypeptide, preferably a GL50 polypeptide, which 
is encoded by a nucleic acid molecule having a nucleotide sequence at least about 50%, 55%, 60%, 65%, 70%, 75%, 
80%, 85%, 90%, 95%, 98% or more identity to an amino acid sequence (e.g., to the entire length of the amino acid 
45 sequence) including SEQ ID NO:2, 4, or 6. 

[0022] This invention further features an isolated GL50 polypeptide which is encoded by a nucleic acid molecule having 
a nucleotide sequence which hybridizes under stringent hybridization conditions to a nucleic acid molecule comprising 
the nucleotide sequence of SEQ ID NO:1 , 3, or 5, or a complement thereof. 

[0023] The polypeptides of the present invention can be operatively linked to a non-GL50 polypeptide (e.g., heterol- 
50 ogous amino acid sequences) to form fusion proteins. The invention further features antibodies, such as monoclonal or 
polyclonal antibodies, that specifically bind polypeptides of the invention, preferably GL50 polypeptides. In addition, the 
GL50 polypeptides, e.g., biologically active polypeptides, can be incorporated into pharmaceutical compositions, which 
optionally include pharmaceutically acceptable carriers. 

[0024] In another aspect, the present invention provides a method for detecting the presence of a GL50 nucleic acid 
55 molecule or polypeptide in a biological sample by contacting the biological sample with an agent capable of detecting 
a GL50 nucleic acid molecule or polypeptide such that the presence of a GL50 nucleic acid molecule or polypeptide is 
detected in the biological sample. 

[0025] In another aspect, the present invention provides a method for detecting the presence of GL50 activity in a 
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biological sample by contacting the biological sample with an agent capable of detecting an indicator of GL50 polypeptide 
activity such that the presence of the GL50 polypeptide activity is detected in the biological sample. 
[0026] In another aspect, the invention provides a method for modulating GL50 polypeptide activity comprising con- 
tacting a cell capable of expressing GL50 polypeptide with an agent that modulates GL50 activity such that the GL50 

5 activity in the cell is modulated. In one embodiment, the agent inhibits GL50 activity. In another embodiment, the agent 
stimulates GL50 activity. In one embodiment, the agent is an antibody that binds, preferably specifically, to a GL50 
polypeptide. In another embodiment, the agent modulates expression of GL50 by modulating transcription of a GL50 
gene or translation of a GL50 mRNA. In yet another embodiment, the agent is a nucleic acid molecule having a nucleotide 
sequence that is antisense to the coding strand of a GL50 mRNA or a GL50 gene. 

10 [0027] In one embodiment, the methods of the present invention are used to treat a subject having a disorder (char- 
acterized by aberrant GL50 polypeptide or nucleic acid expression or activity) or a condition that would benefit from 
modulation, either up or downmodulation, of a GL50 molecule by administering an agent which is a GL50 modulator to 
the subject. In one embodiment, the GL50 modulator is a GL50 polypeptide. In another embodiment the GL50 modulator 
is a GL50 nucleic acid molecule. In another embodiment a GL50 modulator molecule that modulates the interaction 

15 between GL50 and a ligand of GL50 or a molecule that interacts with the intracellular domain of GL50. In yet another 
embodiment, the GL50 modulator is a peptide, peptidomimetic, or other small molecule. In a preferred embodiment, the 
disorder characterized by aberrant GL50 polypeptide or nucleic acid expression is an immune system disorder or condition 
that would benefit from modulation of a GL50 activity. 

[0028] The present invention also provides a diagnostic assay for identifying the presence or absence of a genetic 
20 alteration characterized by at least one of (i) aberrant modification or mutation of a gene encoding a GL50 polypeptide; 
(ii) mis-regulation of the gene; and (iii) aberrant post-translational modification of a GL50 polypeptide, wherein a wild- 
type form of the gene encodes a polypeptide with a GL50 activity. 

[0029] In another aspect the invention provides a method for identifying a compound that binds to or modulates the 
activity of a GL50 polypeptide. The method includes providing an indicator composition comprising a GL50 polypeptide 
25 having GL50 activity, contacting the indicator composition with a test compound, and determining the effect of the test 
compound on GL50 activity in the indicator composition to identify a compound that modulates the activity of a GL50 
polypeptide. 

[0030] In another aspect, the invention pertains to nonhuman transgenic animal that contains cells carrying atransgene 
encoding a GL50 member polypeptide. 

30 [0031] In one embodiment, the present invention provides methods for treating cancer involving administering to a 
subject suffering from a tumor comprising administering a stimultory form of a GL50 molecule. In a preferred embodiment, 
the stimulatory form of a GL50 molecule is a soluble form of GL50 and includes the extracellular domain of a costimulatory 
molecule. In one embodiment, the costimulatory molecule is monospecific. In one embodiment, the costimulatory mol- 
ecule is dimeric. In one embodiment, the costimulatory molecule is bivalent. 

35 [0032] In another preferred embodiment, the costimulatory molecule is fused to a second protein or polypeptide which 
includes a portion of an immunoglobulin molecule (e.g., a portion of an immunoglobulin molecule that includes cysteine 
residues; a portion of an immunoglobulin molecule that includes the hinge, CH2, and CH3 regions of a human immu- 
noglobulin molecule; or a portion of an immunoglobulin molecule that includes the hinge, CH1 , CH2, and CH3 regions 
of a human immunoglobulin molecule). In yet another embodiment, the portion of the immunoglobulin molecule has 

40 been modified to reduce complement fixation and/or Fc receptor binding. 

[0033] In yet another aspect, the invention pertains to a method for reducing the proliferation of a tumor cell comprising 
contacting an immune cell with an activating form of a GL50 molecule such that an immune response to the tumor cell 
is enhanced and proliferation of the tumor cell is reduced. 

[0034] In one embodiment, the activating form of a GL50 molecule is a soluble polypeptide comprising the extracellular 
45 domain of GL50. 

[0035] In another embodiment, the activating form of a GL50 molecule is a cell associated polypeptide comprising the 
extracellular domain of GL50. 

[0036] In yet another embodiment, the invention pertains to a method for screening for a compound which modulates 
GL50 mediated activation of an immune cell comprising: i) contacting a polypeptide comprising at least one GL50 
50 polypeptide domain with a test compound and a GL50 binding partner and ii) identifying compounds that modulate the 
interaction of the polypeptide with the GL50 binding partner to thereby identify compounds that modulate GL50 mediated 
activation of an immune cell. 

[0037] In one embodiment, the polypeptide comprises a GL50 domain selected from the group consisting of: a trans- 
membrane domain, a cytoplasmic domain, and an extracellular domain. 
55 [0038] In one embodiment, the domain is a splice variant of a GL50 cytoplasmic domain. 

[0039] In one embodiment, the GL50 polypeptide domain comprises at least one amino acid substitution. 
[0040] In one aspect, the invention pertains to a method for screening for a compound which modulates signal trans- 
duction in an immune cell comprising contacting an immune cell that expresses a GL50 molecule with a test compound 
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and determining the ability of the test compound to modulate signal transduction via GL50 to thereby identify a compound 
with modulates a signal in an immune cell. 

Brief Description of the Drawings 

[0041] 

Figure 1 shows the complete nucleotide sequence of murine GL50-1 (mGL50-1), based on signal sequence clone 
(position 1-519) and RecA isolated clone (position 374-2718). Predicted nucleotides encoding a signal sequence 
are boxed and the hydrophobic transmembrane domain is underlined. 
Figure 2 shows the nucleotide sequence of murine GL50-2 (mGL50-2) product. 

Figure 3 shows a sequence alignment of mGL50-1 and mGL50-2 product. Sequence divergence occurs at nucleotide 
1 027 for mGL50-1 and at 960 for mGL50-2. 

Figure 4 shows isoform specific RT-PCR of mGL50-1 and mGL50-2. 

Figure 5 shows isoform specific Northern Blot analysis af mGL50-1 and mGL50-2. 

Figure 6 shows the nucleotide sequence of AB014553 RACE product. The boxed region is an area of divergence 
between the published AB014553 cDNA sequence and the RACE product. Final nested RACE primer extends from 
position 1 to 22, corresponding to nucleotides 655 to 676. 

Figure 7 shows an alignment of the translated RACE product and the published AB01 4553 cDNA. Divergence occurs 
at residues 299 of the published AB014553 cDNA and residues 123 of the RACE product. 
Figure 8 shows the sequence of human GL50 (hGL50). 

Figure 9 shows hydropathy plot analysis of GL50, merged AB014553 RACE product (hGL50), and mouse and 
human B7-1 and B7-2. Significant hydropathy profiles are seen between GL50 and AB014553. 
Figure 1 0 shows RT-PCR Southern blot analysis of the published AB01 4553 cDNA and AB01 4553 RACE products. 
Figure 11 shows northern analysis of multiple human tissue RNA blots. The coding sequences of the 
hGL50/AB014553 were used as probes. 

Figure 12 shows a pileup analysis of hGL50, mGL50-1, hB7-1, mB7-2, hB7-2, mB7-2 in which the signal peptide, 
Ig-like domains, transmembrane, and cytoplasmic domains are indicated. The Predicted hydrophobic transmem- 
brane residues are underlined and asterisks denote residues which contribute to Ig structure. The extracellular 
cysteines and tryptophans, indicators of Ig structure, are shown in bold. 

Figure 13 shows dendrogram analysis representing genetic distances between B7-1, B7-2 and GL50 proteins. 
Y08823 is the chicken CD80-like protein and MM867065_1 is the mouse butyrophilin. 

Figure 14 shows results of a GL50 COS transfection study. mGL50-1 was expressed in COS cells followed by 
staining with either ICOS-lg, CD28-lg, CTLA4-lg. Binding of ICOS Ig by cells expressing mGL50-1 was detected. 
Figure 15 depicts a schematic diagram of mGL50-1 and mGL50-2. Sequence divergence, indicated by vertical line, 
occurs at nucleotide 1027 for mGL50-1 and 960 for mGL50-2. The repetitive sequence (hatched box) is found in 
the 3' UTR of mGL50-2 encompassing nucleotides 1 349-1 554. Dashes and arrowheads represent oligonucleotides 
used in RT-PCR analysis. Horizontal lines represent probes used in Northern blot analysis. 

Figure 16 depicts a protein sequence alignment between mGL50-1, mGL50-2, hGL50, and Y08823. Sequences 
were aligned with PileUp, and shared residues between these molecules are boxed. Letters above sequences 
denote secondary peptide structures as predicted for Y08823 based on the crystal structure of B7-1. The exon 
encoding hGL50 cytoplasmic domain 1 sequences is indicated by bar labeled Cy-1. 

Figure 17 depicts flow cytometric analysis of ICOS binding to mouse, human, and chicken GL50-related proteins. 
COS cells transfected with expression plasmids encoding mGL50-1 , mGL50-2, hGL50, and the chicken B7-like 
protein Y08823 were incubated with mlCOS-mlgG2am, hlCOS-mlgG2am or mCTLA4-mlgG2am, followed by sec- 
ondary staining with anti-mouse lgG2a biotin and detection with streptavidin-PE. 

Figure 18 depicts ICOS binding to WEHI 231. Titered amounts of mlCOS-mlgG2am or mCTLA4-mlgG2am were 
used to stain WEHI 231 cells in the presence of blocking anti B7-1 and B7-2 antibodies or isotype controls. 
Figure 19 depicts ICOS binding to undifferentiated ES cells. Analysis of undifferentiated ES cells counter stained 
with anti-B7-1 and mlCOS-mlgG2am reagents resulted in the positive staining for both B7-1 and ICOS-ligand. 
Figure 20 depicts immunophenotyping of Balb/c and RAG1 -/- splenocyte subsets. Two dimensional plots of 1 0,000 
stained cells are presented; samples with 50,000 data points are indicated by asterisks. (A) Enriched splenocytes 
from Balb/C or RAG1 -/- mice were stained with mlCOS-mlgG2am and FITC-conjugated antibodies against CD3, 
CD24, CD45R/B220, pan NK, MHC class II, or CD40. To further phenotype the CD4+, ICOS-ligand+ cells, RAG1 
-/- cells were stained with PE-labeled anti-CD4 and FITC-labeled anti-CD1 1 c. (B) Enriched splenocytes from RAG1 
-/- and Balb/C mice (untreated, ConA activated, or LPS activated) were stained with mlCOS-mlgG2am and antibodies 
to CD4, CD8, CD19, CD11b, CD11c and CD69. 

Figure 21 depicts a phylogenetic representation ofGL50/B7 ligands and CD28/CTLA4/ICOS receptors. Distance 
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proportional phylograms were generated using values from Tables 5 (GL50/B7 ligands) and 6 (CD28/CTLA4/ICOS). 
Bars represent genetic distance expressed as substitutions per 1 00 amino acids. (A) Phylogram of GL50/B7 related 
proteins. Accession No. MMU67065_1 represents mouse butyrophilin. (B) Phylogram of ICOS/CD28/CTLA4 pro- 
teins. 

5 Figure 22 depicts proliferation and cytokine induction by GL50 costimulation of T cells, in the absence or presence 

of anti-CD28 blocking antibodies. Note: hGLSO.Fc is the same as hGL50-lgG2am. 

Figure 23 depicts T cell proliferation induced by GL50 costimulation in the presence of varied concentrations of anti- 
CD28 blocking antibodies and anti-CD3 stimulation. 

Figure 24 depicts cytokine induction by GL50 costimulation in T cells in the absence or presence of CD28 stimulation. 

10 Figure 25 depicts the ability of GL50-lgG2a to inhibit tumor growth in mice. 

Figure 26 depicts the sequence of the hlCOS-mlgG2am fusion protein. (A) The nucleotide sequence encoding 
hlCOS-mlgG2am (set forth as SEQ ID NO:23). The oncostatin-M leader sequence is encoded by the underlined 
nucleotides. Boxed nucleotides encode the mouse lgG2am domain of the fusion protein. The translation initiation 
site is indicated by an X. Introns and untranslated regions are indicated by a dashed line. The stop codon is indicated 

15 by a double underline. (B) The predicted amino acid sequence (set forth as SEQ ID NO:24) of the hlCOS-mlgG2am 

fusion protein. 

Figure 27 depicts the sequence of the mlCOS-mlgG2am fusion protein. (A) The nucleotide sequence encoding 
mlCOS-mlgG2am (set forth as SEQ ID NO:25). The oncostatin-M leader sequence is encoded by the underlined 
nucleotides. Boxed nucleotides encode the mouse lgG2am domain of the fusion protein. The translation initiation 
20 site is indicated by an X. Introns and untranslated regions are indicated by a dashed line. The stop codon is indicated 

by a double underline. (B) The predicted amino acid sequence (set forth as SEQ ID NO:26) of the mlCOS-mlgG2am 
fusion protein. 

Figure 28 depicts the sequence of the hGL50-mlgG2am fusion protein. (A) The nucleotide sequence encoding 
hGL50-mlgG2am (set forth as SEQ ID NO:27). The oncostatin-M leader sequence is encoded by the underlined 
25 nucleotides. Boxed nucleotides encode the mouse lgG2am domain of the fusion protein. The translation initiation 

site is indicated by an X. Introns and untranslated regions are indicated by a dashed line. The stop codon is indicated 
by a double underline. (B) The predicted amino acid sequence (set forth as SEQ ID NO:28) of the hGL50-mlgG2am 
fusion protein. 

Figure 29 depicts the sequence of the mGL50-mlgG2am fusion protein. (A) The nucleotide sequence encoding 
30 mGL50-mlgG2am (set forth as SEQ ID NO:29). The oncostatin-M leader sequence is encoded by the underlined 

nucleotides. Boxed nucleotides encode the mouse lgG2am domain of the fusion protein. The translation initiation 
site is indicated by an X. Introns and untranslated regions are indicated by a dashed line. The stop codon is indicated 
by a double underline. (B) The predicted amino acid sequence (set forth as SEQ ID NO:30) of the mGL50-mlgG2am 
fusion protein. 

35 Figure 30 depicts ICOS-lg staining of various splenic cell types. 

Figure 31 depicts the reduction of tumorigenicity of tumor cells transfected with GL50. 

Detailed Description of the Invention 

40 [0042] In addition to the previously characterized B lymphocyte activation antigens, e.g., B7-1 and B7-2, there are 
other antigens on the surface of antigen presenting cells (e.g., B cells, monocytes, dendritic cells, Langerhan cells, 
keratinocytes, endothelial cells, astrocytes, fibroblasts, oligodendrocytes) which costimulate T cells. 
[0043] The present invention is based, at least in part, on the discovery of novel molecules, referred to herein as GL50 
polypeptides. Murine GL50-1 (mGL50-1) was isolated from an IL-12 activated mouse lymph node library. The nucleotide 

45 sequence of mGL50-1 is shown in SEQ ID NO:1. The derived polypeptide sequence of full length mouse mGL50-1 is 
shown in SEQ ID NO:2. The sequence shares approximately 20% sequence identity with mouse B7-1 and mouse B7-2. 
mGL50-1 encodes a 322 amino acid polypeptide containing a leader sequence, extracellular Ig-like domains, a hydro- 
phobic transmembrane domain, and an intracellular domain comprising one tyrosine residue. 

[0044] 3' RACE PCR with mouse peripheral blood lymphocyte (PBL) RNA revealed an alternatively spliced form of 
50 mouse GL50 (mGL50-2). The nucleotide sequence of murine GL50-2 (mGL50-2) is shown in S EQ I D NO:3. The nucleotide 
sequence encoded a polypeptide having a divergent 27 amino acid intracellular domain, which included an additional 
three tyrosines, a 3' untranslated region with consensus polyadenylation signal, and a poly A tail which are shown in 
SEQ ID NO:4. Transcripts of both mGL50-1 and mGL50-2 were found by RT-PCR and Northern blot analysis and were 
predominantly localized in lymphoid organs of multiple tissue panels. The murine GL50 sequences identified were found 
55 to be related to a previously reported human brain cDNA clone, GenBank Accession Number AB014553. 

[0045] 3' RACE of human PBL cDNA was performed to identify human clones related to murine GL50. Clones encoding 
alternative 3' sequences were identified. The nucleotide sequence of the resulting human GL50 (hGL50 [AB014553- 
RACE]) clone is shown in SEQ ID NO:5. The nucleotide sequence encodes a 309 amino acid protein sharing about 
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26% amino acid sequence identity with the mGL50-1 , 28 % identity with mGL50-2, and amino acid sequence, approx- 
imately 13% amino acid sequence identity with human B7-1, and about 13% amino acid sequence identity with human 
and mouse B7-2. 

[0046] Flow cytometric assays using murine GL50-1lg fusion protein as a reagent demonstrated binding to COS 
transfectants expressing mouse ICOS, but not to cells expressing CD28 or CTLA-4. These results confirm that GL50 
molecules are novel members of the B7 family of molecules. 

GL50 Nucleic Acid and Polypeptide Molecules 

[0047] In one embodiment, the isolated nucleic acid molecules of the present invention encode eukaryotic GL50 
polypeptides. 

[0048] The GL50 family of molecules share a number of conserved regions, including signal domains, IgV domains 
and the IgC domains. For example, in the case of mGL50-1 (SEQ ID NO:1), the consensus 2718 nucleotide mGL50-1 
sequence encodes a 322 amino acid protein with a predicted mass of 36 kDa. Hydropathy plot of the open reading frame 
predicted a structure corresponding to a leader sequence (encoded by about nucleotides 67 to 195), an extracellular 
domain (encoded by about nucleotides 1 96 to 904), a hydrophobic transmembrane region (encoded by about nucleotides 
905 to 961 ) and a potential intracellular cytoplasmic domain (encoded by about nucleotides 962 to 1 032). Signal peptide 
cleavage was predicted at position 46 in the amino acid sequence. In one embodiment, the extracellular domain of a 
GL50 polypeptide comprises the IgV and IgC domains after cleavage of the signal sequence, but not the transmembrane 
and cytoplasmic domains of a GL50 polypeptide (e.g., corresponding to the amino acid sequence from about amino 
acid 47-277 of GL50-1 or the amino acid sequence from about amino acid 22 to about amino acid 278 of hGL50 as set 
forth in Figure 1 6). 

[0049] Analysis of the mGL50-1 amino acid sequence suggested structural similarity to an Ig-domain in the cytoplasmic 
domain of the protein. In keeping with an Ig-like structure, 4 cysteines were found in the extracellular domain, allowing 
for the possibility of intramolecular bonding and distinct structural conformation corresponding to an IgV-like domain and 
an IgC-like domain. These regions are both Ig superfamily member domains and are art recognized. These domains 
correspond to structural units that have distinct folding patterns known as Ig folds. Ig folds are comprised of a sandwich 
of two p sheets, each consisting of antiparallel p strands of 5-1 0 amino acids with a conserved disulfide bond between 
the two sheets in most, but not all, domains. IgC domains of Ig, TCR, and MHC molecules share the same types of 
sequence patterns and are referred to as C1 -set within the Ig superfamily. Other IgC domains fall within other sets. IgV 
domains also share sequence patterns and are called V set domains. IgV domains are longer than C-domains and form 
an additional pair of p strands. 

[0050] An alignment of the mGL50-2, mGL50-1, hGL50, and chicken Y08823 molecule are presented in Figure 16. 
Each of the molecules comprises a signal peptide, an IgV-like domaine, an IgC-like domain, a transmembrane domaine 
and a cytoplasmic domain. Domains of mGL50-2, hGL50, and Y08823 corresponding to those in mGL50-1 are presented 
in Figure 16. 

[0051] A protein alignment was made of the GL50 polypeptides, the published AB014553 sequence, and the human 
and mouse B7-1 and B7-2 sequences using the Geneworks protein alignment program with the parameters set at: cost 
to open gap = 5, cost to lengthen gap = 5, minimum diagonal length = 4, maximum diagonal offset =130, consensus 
cutoff= 50%, and using the Pam 250 matrix. The results of the alignment are presented below in Table 1. 

TABLE 1 



Protein Alignment for G150-related proteins 





AB014553 


hGL50 


mGL50-1 


mGL50-2 


hB7-2 


mB7-2 


hB7-1 


mB7-1 


ABOM553 


100 


59 


26 


28 


13 


13 


13 


7 


hGL50 




100 


42 


41 


17 


17 


17 


12 


GL50-1 






100 


92 


19 


19 


20 


14 


GL50-2 








100 


20 


21 


20 


13 


hB7-2 










100 


48 


19 


21 


mB7-2 












100 


20 


24 


hB7-1 














100 
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(continued) 

Protein Alignment for G150-related proteins 

AB014553 hGL50 mGL50-1 mGL50-2 hB7-2 mB7-2 hB7-1 mB7-1 
mB7-1 100 

Alignments were done using the Geneworks protein alignment program with the cost to open gap =5, 
cost to lengthen gap=5, min. diagonal length=4, max. diagonal offset= 130, consensus cutoff=50%, 
Pam 250 matrix. 



[0052] Table 1 shows that the hGL50 polypeptide has approximately 59% amino acid sequence identity with the 
polypeptide encoded by AB01 4553 and approximately 40% amino acid sequence identity with mGL50-1 and mGL50-2. 
mGL50-1 and mGL50-2 share a higher degree of amino acid sequence identity, approximately 92%. The GL50 polypep- 
tides share approximately 20% amino acid sequence identity with other B7 family molecules. 

[0053] Another alignment was made to determine the extent of relatedness between murine GL50, hGL50, human 
B7-1, mouse B7-1, mouse B7-2, and human B7-2 protein sequences. Using a Pileup analysis (Figure 12), 18 amino 
acid locations aligned identically between all six molecules within the extracellular domain. Of the 32 positions that define 
the predicted IgV-like and IgC-like folds of the B7 molecule, 1 3 are identically conserved between all six molecules, most 
notably the 4 cysteines that allow intramolecular folding of domains. Other areas of significant sequence conservation 
were also seen in the extracellular domain, but interestingly the identities of GL50 sequences in certain locations aligned 
more closely with either B7-1 or B7-2 (identity score of 8). For example, a valine residue corresponding to position 86 
of mGL50-1 is shared by hGL50, and B7-2 sequences, but not B7-1. Likewise, the tyrosine at position 87 of mouse 
mGL50-1 is conserved at corresponding locations in hGL50 and B7-1, but not B7-2. Of the 16 positions with identity 
scores of 8, 5 positions are shared by mouse mGL50-1/hGL50 and B7-1 , 4 positions are shared between mouse 
mGL50-1/hGL50 and B7-2, and 6 positions are shared between B7-1 and B7-2. Based on the peptide structure, these 
results suggest that the GL50 sequences occupy a phylogenetic space parallel to the B7 family of proteins. 
[0054] Molecular phylogeny analysis (GrowTree) measuring genetic distance in terms of substitutions per 100 amino 
acids resulted in a dendrogram (Figure 1 3) with independent clustering of mouse/hGL50 (85), m/hB7-2(68) and m/hB7- 
I (88). As an outgroup, mmu67065_1 (mouse butyrophilin) was used. The chicken clone Y08823 also was found to be 
more closely aligned with the GL50 sequences (-1 40) than the B7sequences (21 5-320), indicating that these sequences 
comprised a distinct subfamily of proteins. Distances between the GL50, B7-2 and B7-1 branches were high (21 6-284), 
suggesting that large numbers of substitutions have occurred between these molecules since the inception of the human 
and rodent lineage. The genetic distances among the GL50 nucleic acid molecules are presented below in Table 2. 

TABLE 2 



Genetic Distances among B7 family members 





hGL50 


mGL50-1 


Y08823 


hB7-2 


mB7-2 


hB7-1 


mB7-1 


mmu67065_1 


hGL50 


0 


85 


142 


284 


263 


226 


260 


188 


mGL50-1 
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139 


225 


216 


229 


257 


223 


Y08823 
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235 


322 


215 


223 


223 


hB7-2 
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68 


222 


190 


215 


mB7-2 










0 


88 


211 


21 


hB7-1 












0 


88 


211 


mB7-1 














0 


271 


mmu67065_1 
















0 



[0055] Various aspects of the invention are described in further detail in the following subsections: 
I. Definitions 

[0056] As used herein, the term "immune cell" includes cells that are of hematopoietic origin and that play a role in 
the immune response. Immune cells include lymphocytes, such as B cells and T cells; natural killer cells; myeloid cells, 
such as monocytes, macrophages, eosinophils, mast cells, basophils, and granulocytes. 

[0057] As used herein, the term "T cell" includes CD4+ T cells and CD8+ T cells. The term T cell also includes both 
T helper 1 type T cells and T helper 2 type T cells. The term "antigen presenting cell" includes professional antigen 
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presenting cells (e.g., B lymphocytes, monocytes, dendritic cells, Langerhans cells) as well as other antigen presenting 
cells (e.g., keratinocytes, endothelial cells, astrocytes, fibroblasts, oligodendrocytes). 

[0058] As used herein, the term "immune response" includes T cell mediated and/or B cell mediated immune responses 
that are influenced by modulation of T cell costimulation. Exemplary immune responses include T cell responses, e.g., 
cytokine production, and cellular cytotoxicity. In addition, the term immune response includes immune responses that 
are indirectly effected by T cell activation, e.g., antibody production (humoral responses) and activation of cytokine 
responsive cells, e.g., macrophages. 

[0059] As used herein, the term "costimulatory receptor" includes receptors which transmit a costimulatory signal to 
a immune cell, e.g., CD28. As used herein, the term "inhibitory receptors" includes receptors which transmit a negative 
signal to an immune cell (e.g., CTLA4). An inhibitory signal as transduced by an inhibitory receptor can occur even if a 
costimulatory receptor (such as CD28) in not present on the immune cell and, thus, is not simply a function of competition 
between inhibitory receptors and costimulatory receptors for binding of costimulatory molecules (Fallarino et al. (1998) 
J. Exp. Med. 188:205). Transmission of an inhibitory signal to an immune cell can result in unresponsiveness or anergy 
or programmed cell death in the immune cell. Preferably transmission of an inhibitory signal operates through a mech- 
anism that does not involve apoptosis. As used herein the term "apoptosis" includes programmed cell death which can 
be characterized using techniques which are known in the art. Apoptotic cell death can be characterized, e.g., by cell 
shrinkage, membrane blebbing and chromatin condensation culminating in cell fragmentation. Cells undergoing apop- 
tosis also display a characteristic pattern of internucleosomal DNA cleavage. 

[0060] I n addition to differences in types of receptors, different forms of costimulaotry molecules can be either activating 
or inhibitory. For example, in the case of an activating receptor a signal can be transmitted e.g., by a multivalent form 
of a costimulatory molecule that results in crosslinking of an activating receptor, or a signal can be inhibited, e.g., by a 
form of a costimulatory molecule that binds to an activating receptor, but fails to transmit an activating signal, e.g., by 
competing with activating forms of costimulatory molecules for binding to the receptor. (Certain soluble forms of costim- 
ulatory molecules can be inhibitory, however, there are instances in which a soluble molecule can be stimulatory). 
Similarly, depending upon the form of costimulatory molecule that binds to an inhibitory receptor, either a signal can be 
transmitted (e.g., by a multivalent form of a costimulatory molecule that results in crosslinking of an activating receptor) 
or a signal can be inhibited (e.g., by a form of a costimulatory molecule that binds to an inhibitory receptor, but fails to 
transmit an inhibitory signal). The effects of the various modulatory agents can be easily demonstrated using routine 
screening assays as described herein. 

[0061] As used herein, the term "costimulate" with reference to activated immune cells includes the ability of a "cos- 
timulatory molecule" to provide a second, non-activating receptor mediated signal (a "costimulatory signal") that induces 
proliferation or effector function. For example, a costimulatory signal can result in cytokine secretion, e.g., in a T cell 
that has received a T cell-receptor-mediated signal. Immune cells that have received a cell-receptor mediated signal, 
e.g., via an activating receptor are referred to herein as "activated immune cells." 

[0062] As used herein, the term "activating receptor" includes immune cell receptors that bind antigen, complexed 
antigen (e.g., in the context of MHC molecules), or bind to antibodies. Such activating receptors include T cell receptors 
(TCR), B cell receptors (BCR), cytokine receptors, LPS receptors, complement receptors, and Fc receptors. 
[0063] For example, T cell receptors are present on T cells and are associated with CD3 molecules. T cell receptors 
are stimulated by antigen in the context of MHC molecules (as well as by polyclonal T cell activating reagents). T cell 
activation via the TCR results in numerous changes, e.g., protein phosphorylation, membrane lipid changes, ion fluxes, 
cyclic nucleotide alterations, RNA transcription changes, protein synthesis changes, and cell volume changes. 
[0064] As used herein, the term "inhibitory signal" refers to a signal transmitted via an inhibitory receptor (e.g., CTLA4) 
on a immune cell. Such a signal antagonizes a signal via an activating receptor (e.g., via a TCR, CD3, BCR, or Fc 
molecule) and can result in, e.g., inhibition of second messenger generation; an inhibition of proliferation; an inhibition 
of effector function in the immune cell, e.g., reduced phagocytosis, reduced antibody production, reduced cellular cyto- 
toxicity, the failure of the immune cell to produce mediators, (such as cytokines (e.g., IL-2) and/or mediators of allergic 
responses); or the development of anergy. 

[0065] As used herein, the term "adjuvant" includes agents which potentiate the immune response to an antigen (e.g., 
a tumor-associated antigen). Adjuvants can be administered in conjunction with costimulatory molecules to additionally 
augment the immune response. 

[0066] As used herein, the term "monospecific" includes molecules which have only one specificity, i.e., they specifically 
bind to their cognate ligand, e.g., CD28, CTLA4, or ICOS on T cells. Such monospecific agents have not been engineered 
to include additional specificities and, thus, do not bind in a targeted manner to other cell surface molecules. As used 
herein the term "oligospecific" includes molecules having more than one specificity, e.g., having an additional specificity 
for a molecule other than afor their cognate ligand, e.g., a specificity for a cell surface molecule, such as a tumor 
associated antigen or a T cell receptor. As used herein, the term "bivalent" includes soluble costimulatory molecules 
that have two binding sites for interaction with their ligand per molecule. As used herein, the term "dimeric" includes 
forms that are present as homodimers, i.e., as a unit comprised of two identical subunits which are joined together, e.g., 
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by disulfide bonds. As used herein, the term "multimeric" includes soluble forms having more than two subunits. 
[0067] In another embodiment, an activating form of a GL50 molecule is a soluble GL50 molecule. As used herein, 
the term "soluble" includes molecules, e.g., costimulatory molecules, which are not cell associated. Soluble costimulatory 
molecules retain the function of the cell associated molecules from which they are derived, e.g., they are capable of 
binding to their cognate ligands on T cells and mediating signal transduction via a CD28 and/or CTLA4 molecule on a 
T cell, however, they are in soluble form, i.e., are not membrane bound. Preferably, the soluble compositions comprise 
an extracellular domain of a costimulatory molecule. 

[0068] Preferably, such a soluble form of a GL50 comprises at least a portion of the extracellular domain of a GL50 
molecule. As used herein, the term "extracellular domain of a GL50 molecule" includes a portion of a GL50 molecule 
which, in the cell-associated form of the GL50 molecule, is extracellular. Preferably, the extracellular domain is the 
extracellular domain of a human GL50 molecule. In one embodiment, a soluble costimulatory molecule comprises an 
extracellular domain of a GL50 molecule and further comprises a signal sequence. 

[0069] As used herein, the term "unresponsiveness" includes refractivity of immune cells to stimulation, e.g., stimulation 
via an activating receptor or a cytokine. Unresponsiveness can occur, e.g., because of exposure to immunosuppressants 
or exposure to high doses of antigen. As used herein, the term "anergy" or "tolerance" includes refractivity to activating 
receptor-mediated stimulation. Such refractivity is generally antigen-specific and persists after exposure to the tolerizing 
antigen has ceased. For example, anergy in T cells (as opposed to unresponsiveness) is characterized by lack of cytokine 
production, e.g., IL-2. T cell anergy occurs when T cells are exposed to antigen and receive a first signal (a T cell receptor 
or CD-3 mediated signal) in the absence of a second signal (a costimulatory signal). Under these conditions, reexposure 
of the cells to the same antigen (even if reexposure occurs in the presence of a costimulatory molecule) results in failure 
to produce cytokines and, thus, failure to proliferate. Anergic T cells can, however, mount responses to unrelated antigens 
and can proliferate if cultured with cytokines {e.g., IL-2). For example, T cell anergy can also be observed by the lack 
of IL-2 production by T lymphocytes as measured by ELISA or by a proliferation assay using an indicator cell line. 
Alternatively, a reporter gene construct can be used. For example, anergic T cells fail to initiate IL-2 gene transcription 
induced by a heterologous promoter under the control of the 5' IL-2 gene enhancer or by a multimer of the API sequence 
that can be found within the enhancer (Kang et al. (1 992) Science 257:1 1 34). 

[0070] The GL50 polypeptide and nucleic acid molecules comprise a family of molecules having certain conserved 
structural and functional features. The term "family" when referring to the protein and nucleic acid molecules of the 
invention is intended to mean two or more proteins or nucleic acid molecules having a common structural domain or 
motif and having sufficient amino acid or nucleotide sequence homology as defined herein. Such family members can 
be naturally or non-naturally occurring and can be from either the same or different species. For example, a family can 
contain a first protein of human origin, as well as other, distinct proteins of human origin or alternatively, can contain 
homologues of non-human origin. Members of a family may also have common functional characteristics. The GL50 
molecules described herein are members of a larger family of molecules, the B7 family of costimulatory molecules. The 
term "B7 family" or "B7 molecules" as used herein includes costimulatory molecules that share sequence homology with 
B7 polypeptides, e.g., with B7-1, B7-2, B7-3 (recognized by the antibody BB-1), and/or GL50. For example, as shown 
in Table 1 above, human B7-1 and human B7-2 share approximately 20% amino acid sequence identity. In addition, 
the B7 family of molecules share a common function, e.g., the ability to bind to a B7 family ligand (e.g., one or more of 
CD28, CTLA4, or ICOS) and/or ther ligands on immune cells and have the ability to inhibit or induce costimulation of 
immune cells. 

[0071] As used herein, the term "activity" with respect to a GL50 polypeptide includes activities which are inherent in 
the structure of a GL50 polypeptide. The term "activity" includes the ability to modulate a costimulatory signal in activated 
T cells and induce proliferation and/or cytokine secretion. In addition, the term "activity" includes the ability of a GL50 
polypeptide to bind its natural ligand or binding partner. Preferably, the ligand to which a GL50 polypeptide binds is an 
ICOS molecule. As used herein "activating forms" of costimulatory molecules transmit a signal via a costimulatory 
receptor (e.g., a signal which activates an immune cell if the receptor is an inhibitory receptor which transmits a costim- 
ulatory signal (e.g., CD28 or ICOS) or an inhibitory signal if the receptor is one which transmits a negative signal to an 
immune cell (e.g., CTLA4). Inhibitory forms of a costimulatory molecule prevent transmission of a signal to an immune 
cell (e.g., either a costimulatory signal or a negative signal). 

[0072] As used herein, the term "tumor" includes both benign and malignant (cancerous) neoplasias, (e.g., carcinomas, 
sarcomas, leukemias, and lymphomas). The term "cancer" includes primary malignant tumors (e.g., those whose cells 
have not migrated to sites in the subject's body other than the site of the original tumor) and secondary malignant tumors 
(e.g., those arising from metastasis, the migration of tumor cells to secondary sites that are different from the site of the 
original tumor). 

[0073] As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a 
nucleotide sequence that occurs in nature (e.g., encodes a natural protein). 

[0074] As used herein, an "antisense" nucleic acid molecule comprises a nucleotide sequence which is complementary 
to a "sense" nucleic acid molecule encoding a protein, e.g., complementary to the coding strand of a double-stranded 
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cDNA molecule, complementary to an mRNA sequence or complementary to the coding strand of a gene. Accordingly, 
an antisense nucleic acid molecule can hydrogen bond to a sense nucleic acid molecule. 

[0075] As used herein, the term "coding region" refers to regions of a nucleotide sequence comprising codons which 
are translated into amino acid residues, whereas the term "noncoding region" refers to regions of a nucleotide sequence 
that are not translated into amino acids (e.g., 5' and 3' untranslated regions). 

[0076] As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid 
to which it has been linked. One type of vector is a "plasmid", which refers to a circular double stranded DNA loop into 
which additional DNA segments maybe ligated. Another type of vector is a viral vector, wherein additional DNA segments 
may be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which 
they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). 
Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction 
into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of 
directing the expression of genes to which they are operatively linked. Such vectors are referred to herein as "recombinant 
expression vectors" or simply "expression vectors". In general, expression vectors of utility in recombinant DNA tech- 
niques are often in the form of plasmids. In the present specification, "plasmid" and "vector" may be used interchangeably 
as the plasmid is the most commonly used form of vector. However, the invention is intended to include such other forms 
of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated 
viruses), which serve equivalent functions. 

[0077] As used herein, the term "host cell" is intended to refer to a cell into which a nucleic acid of the invention, such 
as a recombinant expression vector of the invention, has been introduced. The terms "host cell" and "recombinant host 
cell" are used interchangeably herein. It should be understood that such terms refer not only to the particular subject 
cell but to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent 
cell, but are still included within the scope of the term as used herein. 

[0078] As used herein, a "transgenic animal" refers to a non-human animal, preferably a mammal, more preferably a 
mouse, in which one or more of the cells of the animal includes a "transgene". The term "transgene" refers to exogenous 
DNA which is integrated into the genome of a cell from which a transgenic animal develops and which remains in the 
genome of the mature animal, for example directing the expression of an encoded gene product in one or more cell 
types or tissues of the transgenic animal. 

[0079] As used herein, a "homologous recombinant animal" refers to a type of transgenic non-human animal, preferably 
a mammal, more preferably a mouse, in which an endogenous gene has been altered by homologous recombination 
between the endogenous gene and an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic 
cell of the animal, prior to development of the animal. 

[0080] As used herein, an "isolated protein" refers to a protein that is substantially free of other proteins, cellular 
material and culture medium when isolated from cells or produced by recombinant DNA techniques, or chemical pre- 
cursors or other chemicals when chemically synthesized. 

[0081 ] The term "antibody" as used herein also includes an "antigen-binding portion" of an antibody (or simply "antibody 
portion"). The term "antigen-binding portion", as used herein, refers to one or more fragments of an antibody that retain 
the ability to specifically bind to an antigen (e.g. , GL50). It has been shown that the antigen-binding function of an antibody 
can be performed by fragments of a full-length antibody. Examples of binding fragments encompassed within the term 
"antigen-binding portion" of an antibody include (i) a Fab fragment, a monovalent fragment consisting of the VL, VH, CL 
and CH1 domains; (ii)a F(ab') 2 fragment, a bivalent fragment comprising two Fab fragments linked by a disulfide bridge 
at the hinge region; (iii) a Fd fragment consisting of the VH and CH 1 domains; (iv) a Fv fragment consisting of the VL 
and VH domains of a single arm of an antibody, (v) a dAb fragment (Ward et al., (1989) Nature 341 :544-546), which 
consists of a VH domain; and (vi) an isolated complementarity determining region (CDR). Furthermore, although the 
two domains of the Fv fragment, VL and VH, are coded for by separate genes, they can be joined, using recombinant 
methods, by a synthetic linker that enables them to be made as a single protein chain in which the VL and VH regions 
pair to form monovalent molecules (known as single chain Fv (scFv); see e.g., Bird et al. (1988) Science 242:423-426; 
and Huston et al. (1988) Proc. Natl. Acad. Sci. USA 85:5879-5883; and Osbourn et al. 1998, Nature Biotechnology 16: 
778). Such single chain antibodies are also intended to be encompassed within the term "antigen-binding portion" of an 
antibody. Any VH and VL sequences of specific scFv can be linked to human immunoglobulin constant region cDNA or 
genomic sequences, in order to generate expression vectors encoding complete IgG molecules or other isotypes. VH 
and VI can also be used in the generation of Fab , Fv or other fragments of immunoglobulins using either protein chemistry 
or recombinant DNA technology. Other forms of single chain antibodies, such as diabodies are also encompassed. 
Diabodies are bivalent, bispecific antibodies in which VH and VL domains are expressed on a single polypeptide chain, 
but using a linker that is too short to allow for pairing between the two domains on the same chain, thereby forcing the 
domains to pair with complementary domains of another chain and creating two antigen binding sites (see e.g., Holliger, 
P., et al. (1993) Proc. Natl. Acad. Sci. USA 90:6444-6448; Poljak, R. J., et al. (1994) Structure 2:1121-1123). 
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[0082] Still further, an antibody or antigen-binding portion thereof may be part of a larger immunoadhesion molecules, 
formed by covalent or noncovalent association of the antibody or antibody portion with one or more other proteins or 
peptides. Examples of such immunoadhesion molecules include use of the streptavidin core region to make a tetrameric 
scFv molecule (Kipriyanov, S.M., et al. (1995) Human Antibodies and Hybridomas 6:93-101) and use of a cysteine 
residue, a marker peptide and a C-terminal polyhistidine tag to make bivalent and biotinylated scFv molecules (Kipriyanov, 
S.M., et al. (1 994) Mol. Immunol. 31 :1 047-1 058). Antibody portions, such as Fab and F(ab') 2 fragments, can be prepared 
from whole antibodies using conventional techniques, such as papain or pepsin digestion, respectively, of whole anti- 
bodies. Moreover, antibodies, antibody portions and immunoadhesion molecules can be obtained using standard re- 
combinant DNA techniques, as described herein. 

[0083] Antibodies may be polyclonal or monoclonal; xenogeneic, allogeneic, or syngeneic; or modified forms thereof, 
e.g. humanized, chimeric, etc. Preferably, antibodies of the invention bind specifically or substantially specifically to 
GL50 molecules. The terms "monoclonal antibodies" and "monoclonal antibody composition", as used herein, refer to 
a population of antibody molecules that contain only one species of an antigen binding site capable of immunoreacting 
with a particular epitope of an antigen, whereas the term "polyclonal antibodies" and "polyclonal antibody composition" 
refer to a population of antibody molecules that contain multiple species of antigen binding sites capable of interacting 
with a particular antigen. A monoclonal antibody composition, typically displays a single binding affinity for a particular 
antigen with which it immunoreacts. 

[0084] The term "humanized antibody", as used herein, is intended to include antibodies made by a non-human cell 
having variable and constant regions which have been altered to more closely resemble antibodies that would be made 
by a human cell. For example, by altering the non-human antibody amino acid sequence to incorporate amino acids 
found in human germline immunoglobulin sequences. The humanized antibodies of the invention may include amino 
acid residues not encoded by human germline immunoglobulin sequences (e.g., mutations introduced by random or 
site-specific mutagenesis in vitro or by somatic mutation in vivo), for example in the CDRs. The term "humanized 
antibody", as used herein, also includes antibodies in which CDR sequences derived from the germline of another 
mammalian species, such as a mouse, have been grafted onto human framework sequences. 

[0085] An "isolated antibody", as used herein, is intended to refer to an antibody that is substantially free of other 
antibodies having different antigenic specificities {e.g., an isolated antibody that specifically binds GL50 is substantially 
free of antibodies that specifically bind antigens other than GL50). Moreover, an isolated antibody may be substantially 
free of other cellular material and/or chemicals. 

[0086] As used herein, "binding partner" is a target molecule or a molecule with which a GL50 polypeptide binds or 
interacts in nature (e.g., a ligand or an intracellular interactor molecule (such as a molecule that acts either upstream or 
downstream of GL50 in a signal transduction pathway)), such that a GL50 activity is achieved. 

[0087] The term "signal transduction" is intended to encompass the processing of physical or chemical signals from 
the extracellular environment through the cell membrane and into the cell, and may occur through one or more of several 
mechanisms, such as activation/inactivation of enzymes (such as proteases, or other enzymes which may alter phos- 
phorylation patterns or other post-translational modifications), activation of ion channels or intracellular ion stores, effector 
enzyme activation via guanine nucleotide binding protein intermediates, formation of inositol phosphate, activation or 
inactivation of adenylyl cyclase, direct activation (or inhibition) of a transcriptional factor and/or activation. A "signaling 
pathway" refers to the components involved in "signal transduction" of a particular signal into a cell. 
[0088] There is a known and definite correspondence between the amino acid sequence of a particular protein and 
the nucleotide sequences that can code for the protein, as defined by the genetic code (shown below). Likewise, there 
is a known and definite correspondence between the nucleotide sequence of a particular nucleic acid molecule and the 
amino acid sequence encoded by that nucleic acid molecule, as defined by the genetic code. 



GENETIC CODE 
Alanine (Ala, A) 
Arginine (Arg, R) 



GCA, GCC, GCG, GCT 

AGA, ACG, CGA, CGC, CGG, CGT 



Asparagine (Asn, N) 
Aspartic acid (Asp, D) 
Cysteine (Cys, C) 
Glutamic acid (Glu, E) 
Glutamine (Gin, Q) 



AAC, AAT 
GAC, GAT 
TGC, TGT 
GAA, GAG 
CAA, CAG 



Glycine (Gly, G) 
Histidine (His, H) 
Isoleucine (lie, I) 
Leucine (Leu, L) 



GGA, GGC, GGG, GGT 
CAC, CAT 
ATA, ATC, ATT 

CTA, CTC, CTG, CTT, TTA, TTG 
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(continued) 

GENETIC CODE 

Lysine (Lys, K) AAA, AAG 

Methionine (Met, M) ATG 

Phenylalanine (Phe, F) TTC, TTT 

Proline (Pro, P) CCA, CCC, CCG, CCT 

Serine (Ser, S) AGC, AGT, TCA, TCC, TCG, TCT 

Threonine (Thr, T) ACA, ACC, ACG, ACT 

Tryptophan (Trp, W) TGG 

Tyrosine (Tyr, Y) TAC, TAT 

Valine (Val, V) GTA, GTC, GTG, GTT 

Termination signal (end) TAA, TAG, TGA 



An important and well known feature of the genetic code is its redundancy, whereby, for most of the amino acids used 
to make proteins, more than one coding nucleotide triplet may be employed (illustrated above). Therefore, a number of 
different nucleotide sequences may code for a given amino acid sequence. Such nucleotide sequences are considered 
functionally equivalent since they result in the production of the same amino acid sequence in all organisms (although 
certain organisms may translate some sequences more efficiently than they do others). Moreover, occasionally, a meth- 
ylated variant of a purine or pyrimidine may be found in a given nucleotide sequence. Such methylations do not affect 
the coding relationship between the trinucleotide codon and the corresponding amino acid. 

[0089] In view of the foregoing, the nucleotide sequence of a DNA or RNA molecule coding for a GL50 polypeptide 
of the invention (or any portion thereof) can be used to derive the GL50 amino acid sequence, using the genetic code 
to translate the DNA or RNA molecule into an amino acid sequence. Likewise, for any GL50-amino acid sequence, 
corresponding nucleotide sequences that can encode GL50 polypeptide can be deduced from the genetic code (which, 
because of its redundancy, will produce multiple nucleic acid sequences for any given amino acid sequence). Thus, 
description and/or disclosure herein of a GL50 nucleotide sequence should be considered to also include description 
and/or disclosure of the amino acid sequence encoded by the nucleotide sequence. Similarly, description and/or dis- 
closure of a GL50 amino acid sequence herein should be considered to also include description and/or disclosure of all 
possible nucleotide sequences that can encode the amino acid sequence. 



II. Isolated Nucleic Acid Molecules 



[0090] One aspect of the invention pertains to isolated nucleic acid molecules that encode GL50 polypeptides or 
biologically active portions thereof, as well as nucleic acid fragments sufficient for use as hybridization probes to identify 
GL50-encoding nucleic acid molecules (e.g., GL50 mRNA) and fragments for use as PCR primers for the amplification 
or mutation of GL50 nucleic acid molecules. As used herein, the term "nucleic acid molecule" is intended to include DNA 
molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated 
using nucleotide analogs. The nucleic acid molecule can be single-stranded or double-stranded, but preferably is double- 
stranded DNA. 

[0091] An "isolated" nucleic acid molecule is one which is separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid. For example, with regards to genomic DNA, the term "isolated" includes 
nucleic acid molecules which are separated from the chromosome with which the genomic DNA is naturally associated. 
Preferably, an "isolated" nucleic acid molecule is free of sequences which naturally flank the nucleic acid (i.e., sequences 
located at the 5' and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid molecule 
is derived. For example, in various embodiments, the isolated GL50 nucleic acid molecule can contain less than about 
5 kb, 4kb, 3kb, 2kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid molecule, such as 
a cDNA molecule, can be substantially free of other cellular material, or culture medium when produced by recombinant 
techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized. An "isolated" 
GL50 nucleic acid molecule may, however, be linked to other nucleotide sequences that do not normally flank the GL50 
sequences in genomic DNA (e.g., the GL50 nucleotide sequences may be linked to vector sequences). In certain 
preferred embodiments, an "isolated" nucleic acid molecule, such as a cDNA molecule, also may be free of other cellular 
material. However, it is not necessary for the GL50 nucleic acid molecule to be free of other cellular material to be 
considered "isolated" (e.g., a GL50 DNA molecule separated from other mammalian DNA and inserted into a bacterial 
cell would still be considered to be "isolated"). 

[0092] A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule having the nucleotide sequence 
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of SEQ ID NO: 1,3, or 5, or a portion thereof, can be isolated using standard molecular biology techniques and the 
sequence information provided herein. For example, using all or portion of the nucleic acid sequence of SEQ ID NO:1, 
3, or 5, as a hybridization probe, GL50 nucleic acid molecules can be isolated using standard hybridization and cloning 
techniques (e.g., as described in Sambrook, J. et al. Molecular Cloning: A Laboratory Manual, 2nd, ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). 

[0093] Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID NO:1 , 3, or 5 can be isolated by 
the polymerase chain reaction (PCR) using synthetic oligonucleotide primers designed based upon the sequence of 
SEQ ID NO:1 , 3, or 5, respectively. 

[0094] A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, genomic DNA, as a template 
and appropriate oligonucleotide primers according to standard PCR amplification techniques. The nucleic acid so am- 
plified can be cloned into an appropriate vector and characterized by DNA sequence analysis. Furthermore, oligonucle- 
otides corresponding to GL50 nucleotide sequences can be prepared by standard synthetic techniques, e.g., using an 
automated DNA synthesizer. 

[0095] In a preferred embodiment, an isolated nucleic acid molecule of the invention comprises the nucleotide sequence 
shown in SEQ ID NO:1 , 3, or 5. 

[0096] In one embodiment, an isolated nucleic acid molecule of the invention comprises a nucleic acid molecule which 
is a complement of the nucleotide sequence shown in SEQ ID NO: 1 , 3, or 5, or a portion of any of these nucleotide 
sequences. A nucleic acid molecule which is complementary to the nucleotide sequence shown in SEQ ID NO: 1 , 3, or 
5, is one which is sufficiently complementary to the nucleotide sequence shown in SEQ ID NO:1 ,3, or 5, respectively, 
such that it can hybridize to the nucleotide sequence shown in SEQ ID NO:1, 3, or 5, respectively, thereby forming a 
stable duplex. 

[0097] In still another preferred embodiment, an isolated nucleic acid molecule of the present invention comprises a 
nucleotide sequence which is at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or more 
homologous to the nucleotide sequence {e.g., to the entire length of the nucleotide sequence) shown in SEQ ID NO:1, 
3, or 5, or a portion of any of these nucleotide sequences. 

[0098] Moreover, the nucleic acid molecule of the invention can comprise only a portion of the nucleic acid sequence 
of SEQ ID NO:1, 3, or 5, for example a fragment which can be used as a probe or primer or a fragment encoding a 
biologically active portion of a GL50 polypeptide. The nucleotide sequence determined from the cloning of the GL50 
genes allows for the generation of probes and primers designed for use in identifying and/or cloning other GL50 family 
members, as well as GL50 family homologues from other species. The probe/primer typically comprises a substantially 
purified oligonucleotide. In one embodiment, the oligonucleotide comprises a region of nucleotide sequence that hybrid- 
izes under stringent conditions to at least about 12 or 15, preferably about 20 or 25, more preferably about 30, 35, 40, 
45, 50, 55, 60, 65, 75, or 100 consecutive nucleotides of a sense sequence of SEQ ID NO:1, 3, or 5, or of a naturally 
occurring allelic variant or mutant of SEQ ID NO:1 , 3, or 5. In another embodiment, a nucleic acid molecule of the present 
invention comprises a nucleotide sequence which is at least about 400, 450, 500, 550, 600, 650, 700, 750, 800, 900, 
1000, or 1100 nucleotides in length and hybridizes under stringent hybridization conditions to a nucleic acid molecule 
of SEQ ID NO:1, 3, or 5. 

[0099] In another embodiment, a nucleic acid molecule of the invention comprises at least about 1 00, 200, 300, 400, 
500, 600, 700, 800, 900, 1 000, or 1 1 00 contiguous nucleotides of SEQ ID NO:1 , 3, or 5. 

[0100] In one embodiment, a nucleic acid molecule of the invention, e.g., for use as a probe, does not include the 
portion of SEQ ID NO:1 from about nucleotides 1-370 of SEQ ID NO:5. 

[0101] Preferably, an isolated nucleic acid molecule of the invention comprises at least a portion of the coding region 
of SEQ ID NO:1 (shown in nucleotides 67-1 032) or SEQ ID NO:3 (shown in nucleotides 1-1 041 ) or SEQ ID NO:5 (shown 
in nucleotides 24-950). In another embodiment, a nucleic acid molecule of the invention comprises the entire coding 
region of SEQ ID NO:1, 3, or 5. 

[0102] In other embodiments, a nucleic acid molecule of the invention has at least 70% identity, more preferably 80% 
identity, and even more preferably 90% identity with a nucleic acid molecule comprising: at least about 300, 400, 500, 
600, 700, 800, or at about 900 nucleotides of SEQ I D NO:1 , 3, or 5, or at least about 1 000 or 1 1 00 contiguous nucleotides 
of SEQ ID NO:1 or 3. 

[0103] Probes based on the GL50 nucleotide sequences can be used to detect transcripts or genomic sequences 
encoding the same or homologous proteins. In preferred embodiments, the probe further comprises a label group 
attached thereto, e.g., the label group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme cofactor. 
Such probes can be used as a part of a diagnostic test kit for identifying cells or tissues which misexpress a GL50 
polypeptide, such as by measuring a level of a GL50-encoding nucleic acid in a sample of cells from a subject e.g., 
detecting GL50 mRNA levels or determining whether a genomic GL50 gene has been mutated or deleted. 
[0104] A nucleic acid fragment encoding a "biologically active portion of a GL50 polypeptide" can be prepared by 
isolating a portion of the nucleotide sequence of SEQ ID NO:1, 3, or 5, which encodes a polypeptide having a GL50 
biological activity (the biological activities of the GL50 polypeptides are described herein), expressing the encoded portion 
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of the GL50 polypeptide (e.g., by recombinant expression in vitro) and assessing the activity of the encoded portion of 
the GL50 polypeptide. 

[0105] Nucleic acid molecules that differ from SEQ ID NO: 1 , 3, or 5 due to degeneracy of the genetic code, and thus 
encode the same a GL50 member protein as that encoded by SEQ ID NO:1 , 3, or 5 are encompassed by the invention. 
Accordingly, in another embodiment, an isolated nucleic acid molecule of the invention has a nucleotide sequence 
encoding a protein having an amino acid sequence shown in SEQ ID NO:2, 4 or 6. In another embodiment, an isolated 
nucleic acid molecule of the invention has a nucleotide sequence encoding a GL50 polypeptide. 
[0106] In addition to the GL50 nucleotide sequences shown in SEQ ID NO:1 , 3, or 5, it will be appreciated by those 
skilled in the art that DNA sequence polymorphisms that lead to changes in the amino acid sequences of the GL50 
polypeptides may exist within a population (e.g., the human population). Such genetic polymorphism in the GL50 genes 
may exist among individuals within a population due to natural allelic variation. As used herein, the terms "gene" and 
"recombinant gene" refer to nucleic acid molecules which include an open reading frame encoding a GL50 polypeptide, 
preferably a mammalian GL50 polypeptide, and can further include non-coding regulatory sequences, and introns. Such 
natural allelic variations include both functional and non-functional GL50 polypeptides and can typically result in 1-5% 
variance in the nucleotide sequence of a GL50 gene. Any and all such nucleotide variations and resulting amino acid 
polymorphisms in GL50 genes that are the result of natural allelic variation and that do not alter the functional activity 
of a GL50 polypeptide are intended to be within the scope of the invention. 

[0107] Moreover, nucleic acid molecules encoding other GL50 family members and, thus, which have a nucleotide 
sequence which differs from the GL50 family sequences of SEQ ID NO:1 , 3, or 5 are intended to be within the scope of 
the invention. For example, another mGL50-1 can be identified based on the nucleotide sequence of hGL50. Moreover, 
nucleic acid molecules encoding GL50 polypeptides from different species, and thus which have a nucleotide sequence 
which differs from the GL50 sequences of SEQ ID NO:1 ,3, or 5 are intended to be within the scope of the invention. For 
example, an ortholog of the mGL50-1 can be identified based on the murine nucleotide sequence. 
[01 08] Nucleic acid molecules corresponding to natural allelic variants and homologues of the GL50 molecules of the 
invention can be isolated, e.g., based on their homology to the GL50 nucleic acids disclosed herein using the cDNAs 
disclosed herein, or portions thereof, as hybridization probes according to standard hybridization techniques. For ex- 
ample, a GL50 DNA can be isolated from a human genomic DNA library using all or portion of SEQ ID NO:1 , 3, or 5 as 
a hybridization probe and standard hybridization techniques (e.g., as described in Sambrook, J., et al. Molecular Cloning: 
A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989). Moreover, a nucleic 
acid molecule encompassing all or a portion of a GL50 gene can be isolated by the polymerase chain reaction using 
oligonucleotide primers designed based upon the sequence of SEQ ID NO:1,3,or5. For example, mRNAcan be isolated 
from cells (e.g., by the guanidinium-thiocyanate extraction procedure of Chirgwin et al. (1979) Biochemistry 18: 
5294-5299) and cDNA can be prepared using reverse transcriptase (e.g., Moloney MLV reverse transcriptase, available 
from Gibco/BRL, Bethesda, MD; or AMV reverse transcriptase, available from Seikagaku America, Inc., St. Petersburg, 
FL). Synthetic oligonucleotide primers for PCR amplification can be designed based upon the nucleotide sequence 
shown in SEQ ID NO:1, 3, or 5. A nucleic acid molecule of the invention can be amplified using cDNA or, alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR amplification tech- 
niques. The nucleic acid so amplified can be cloned into an appropriate vector and characterized by DNA sequence 
analysis. Furthermore, oligonucleotides corresponding to a GL50 nucleotide sequence can be prepared by standard 
synthetic techniques, e.g., using an automated DNA synthesizer. 

[0109] In another embodiment, an isolated nucleic acid molecule of the invention can be identified based on shared 
nucleotide sequence identity using a mathematical algorithm. Such algorithms are outlined in more detail below (see, 
e.g., section III). 

[0110] In another embodiment, an isolated nucleic acid molecule of the invention is at least 15, 20, 25, 30 or more 
nucleotides in length and hybridizes under stringent conditions to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO:1 , 3, or 5. In other embodiment, the nucleic acid molecule is at least 30, 50, 100, 150, 200, 
250, 300, 350, 400, 450, 500, 550, or 600 nucleotides in length. As used herein, the term "hybridizes under stringent 
conditions" is intended to describe conditions for hybridization and washing under which nucleotide sequences at least 
30%, 40%, 50%, or 60% homologous to each other typically remain hybridized to each other. Preferably, the conditions 
are such that sequences at least about 70%, more preferably at least about 80%, even more preferably at least about 
85% or 90% homologous to each other typically remain hybridized to each other. Such stringent conditions are known 
to those skilled in the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3. 1 -6.3.6. A preferred, non-limiting example of stringent hybridization conditions are hybridization in 6X sodium chloride/ 
sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50-65°C. Preferably, 
an isolated nucleic acid molecule of the invention that hybridizes under stringent conditions to the sequence of SEQ ID 
NO:1 , 3, or 5 corresponds to a naturally-occurring nucleic acid molecule. 

[0111] As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a 
nucleotide sequence that occurs in nature (e.g., encodes a natural protein). In addition to the GL50 nucleotide sequences 
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shown in SEQ ID NO:1 , 3, or 5 it will be appreciated by those skilled in the art that DNA sequence polymorphisms that 
lead to minor changes in the nucleotide or amino acid sequences of a GL50 may exist within a population. Such genetic 
polymorphism in a GL50 gene may exist among individuals within a population due to natural allelic variation. Such 
natural allelic variations can typically result in 1-2 % variance in the nucleotide sequence of the gene. Such nucleotide 
variations and resulting amino acid polymorphisms in a GL50 that are the result of natural allelic variation and that do 
not alter the functional activity of a GL50 polypeptide are within the scope of the invention. 

[0112] In addition to naturally-occurring allelic variants of GL50 sequences that may exist in the population, the skilled 
artisan will further appreciate that minor changes may be introduced by mutation into nucleotide sequences, e.g., of 
SEQ ID NO: 1 , 3, or 5, thereby leading to changes in the amino acid sequence of the encoded protein, without altering 
the functional activity of a GL50 polypeptide. For example, nucleotide substitutions leading to amino acid substitutions 
at "non-essential" amino acid residues may be made in the sequence of SEQ ID NO:1 , 3, or 5. A "non-essential" amino 
acid residue is a residue that can be altered from the wild-type sequence of a GL50 nucleic acid molecule (e.g., the 
sequence of SEQ ID NO:1 , 3, or 5) without altering the functional activity of a GL50 molecule. Exemplary residues which 
are non-essential and, therefore, amenable to substitution, can be identified by one of ordinary skill in the art by performing 
an amino acid alignment of B7 family members (or of GL50 family members) and determining residues that are not 
conserved. Such residues, because they have not been conserved, are more likely amenable to substitution. 
[0113] Accordingly, another aspect of the invention pertains to nucleic acid molecules encoding GL50 polypeptides 
that contain changes in amino acid residues that are not essential for a GL50 activity. .Such GL50 polypeptides differ in 
amino acid sequence from SEQ ID NO:2, 4, or 6 yet retain an inherent GL50 activity. An isolated nucleic acid molecule 
encoding a non-natural variant of a GL50 polypeptide can be created by introducing one or more nucleotide substitutions, 
additions or deletions into the nucleotide sequence of SEQ ID NO:1 , 3, orSsuch thatoneor more amino acid substitutions, 
additions or deletions are introduced into the encoded protein. Mutations can be introduced into SEQ ID NO:1 , 3, or 5 
by standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative 
amino acid substitutions are made at one or more non-essential amino acid residues. A "conservative amino acid 
substitution" is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. 
Families of amino acid residues having similar side chains have been defined in the art, including basic side chains (e.g., 
lysine, arginine, histidine), acidic side chains {e.g., aspartic acid, glutamic acid), uncharged polar side chains {e.g., 
glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains {e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains {e.g., threonine, valine, isoleu- 
cine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a nonessential amino acid 
residue in a GL50 is preferably replaced with another amino acid residue from the same side chain family. 
[01 1 4] Alternatively, in another embodiment, mutations can be introduced randomly along all or part of a GL50 coding 
sequence, such as by saturation mutagenesis or rational cassette mutagenesis, and the resultant mutants can be 
screened for their ability to bind to a ligand, or to bind to intracellular interactor molecules to identify mutants that retain 
functional activity. Following mutagenesis, the encoded GL50 mutant protein can be expressed recombinantly in a host 
cell and the functional activity of the mutant protein can be determined using assays available in the art for assessing 
a GL50 activity. 

[0115] Accordingly, another aspect of the invention pertains to nucleic acid molecules encoding GL50 polypeptides 
that contain changes in amino acid residues that are not essential for activity. Homology alignments, such as the pile- 
up analysis shown herein, can be used to select amino acids which may be amenable to alteration. For example, the 
18 amino acid locations which aligned identically between all six molecules within the extracellular domain are well 
conserved and are, therefore, less likely to be amenable to alteration. Similarly, of the 32 positions that define the 
predicted IgV-like and IgC-like folds of the B7 family molecules, 1 3 are identically conserved between all six molecules, 
most notably the 4 cysteines that allow intramolecular folding of domains. Therefore, these amino acids are unlikely to 
be amenable to alteration. Other areas of significant sequence conservation were also seen in the extracellular domain. 
For example, valine residue corresponding to position 86 of mGL50-1 is shared by hGL50, and B7-2 sequences may 
not be amenable to alteration. Likewise, the tyrosine at position 87 of mouse mGL50-1 which is conserved at corre- 
sponding locations in hGL50 and B7-1. The 16 positions with identity scores of 8 (5 positions are shared by mouse 
mGL50-1/hGL50 and B7-1 , 4 positions shared between mouse mGL50-1/hGL50 and B7-2, and 6 positions are shared 
between B7-1 and B7-2) may not be amenable to alteration. In addition, positions in the transmembrane and/or cyto- 
plasmic domains conserved among the GL50 family members (in particular tyrosind residues in the transmembrane or 
cytoplasmic domain of a GL50 molecule). Again, these positions are unlikely to be amenable to alteration if GL50 activity 
is to be maintained. 

[01 1 6] Yet another aspect of the invention pertains to non-naturally occurring GL50 molecules nucleic acid molecules 
which are chimeric in that they comprise a nucleic acid sequence encoding GL50 transmembrane or cytoplasmic domain 
which they do not naturally comprise. For example, in one embodiment, transmembrane and/or cytoplasmic domains 
of a GL50 domain can be "swapped" or "shuffled" using standard molecular biology techniques to create GL50 molecules 
that have altered signal transduction properties as compared to a naturally occurring GL50 molecule. Such nucleic acid 
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and polypeptide molecules are also embraced by the invention. 

[0117] In yet another aspect, GL50 nucleic acid molecules can be engineered to comprise nucleic acid sequences 
encoding at least a portion of another B7 family member, e.g., B7-1 or B7-2. For example, using standard techniques, 
nucleic acid molecules can be made that encode hybrid GL50/B7 molecules with ligand binding and/or signaling properties 
that differ from those seen in naturally occurring molecules. For example, in one embodiment, the sequence of chicken 
GL50 (Y08823) can be used to design molecules with altered signaling and/or binding properties. The sequence similarity 
between avian GL50 and mammalian forms of the molecule and their difference in ligand preference can be exploited 
to this end. For instance, progressive substitution of residues conserved between avian GL50-like protein (Y08823) and 
GL50 with those found in GL50 (to make the molecule more GL50-like) may result in a functional molecule that binds 
to ICOS and CD28 and CTLA4. Ig-fusion or other constructs comprising huybrid GL50/B7 proteins can be used to 
achieve differential activation or inhibition of target cell populations and skewing of T cell phenotypes. Such nucleic acid 
and polypeptide molecules are also embraced by the invention. 

[0118] Yet another aspect of the invention pertains to isolated nucleic acid molecules encoding a GL50 fusion proteins. 
Such nucleic acid molecules, comprising at least a first nucleotide sequence encoding a GL50 polypeptide, polypeptide 
or peptide operatively linked to a second nucleotide sequence encoding a non- GL50 polypeptide, polypeptide or peptide, 
can be prepared by standard recombinant DNA techniques. 

[0119] In a preferred embodiment, a mutant GL50 polypeptide can be assayed for the ability to: 1) costimulate (or 
inhibit the costimulation of, e.g., in soluble form) the proliferation and/or effector function (e.g., cytokine secretion (such 
as, for example IL-2 or IL-10) of activated T cells; 2) bind to an anti-B7 antibody; and/or 3) bind to a GL50 ligand (e.g., 
to CD28, CTLA4, and/or ICOS). 

[0120] In addition to the nucleic acid molecules encoding GL50 polypeptides described above, another aspect of the 
invention pertains to isolated nucleic acid molecules which are antisense thereto. An "antisense" nucleic acid molecule 
comprises a nucleotide sequence which is complementary to a "sense" nucleic acid molecule encoding a protein, e.g., 
complementary to the coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence. 
Accordingly, an antisense nucleic acid molecule can hydrogen bond to a sense nucleic acid molecule. The antisense 
nucleic acid molecule can be complementary to an entire GL50 coding strand, or only to a portion thereof. In one 
embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the coding strand of a nucleotide 
sequence encoding GL50. The term "coding region" refers to the region of the nucleotide sequence comprising codons 
which are translated into amino acid residues. In another embodiment, the antisense nucleic acid molecule is antisense 
to a "noncoding region" of the coding strand of a nucleotide sequence encoding GL50. The term "noncoding region" 
refers to 5' and 3' sequences which flank the coding region that are not translated into amino acids (i.e., also referred 
to as 5' and 3' untranslated regions). 

[0121] Given the coding strand sequences encoding GL50 disclosed herein, antisense nucleic acid molecules of the 
invention can be designed according to the rules of Watson and Crick base pairing. The antisense nucleic acid molecule 
can be complementary to the entire coding region of GL50 mRNA, but more preferably is an oligonucleotide which is 
antisense to only a portion of the coding or noncoding region of GL50 mRNA. For example, the antisense oligonucleotide 
can be complementary to the region surrounding the translation start site of GL50 mRNA. An antisense oligonucleotide 
can be, for example, about 5, 1 0, 1 5, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid molecule 
of the invention can be constructed using chemical synthesis and enzymatic ligation reactions using procedures known 
in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously modified nucleotides designed to increase the biological stability of 
the molecules or to increase the physical stability of the duplex formed between the antisense and sense nucleic acids, 
e.g., phosphorothioate derivatives and acridine substituted nucleotides can be used. Examples of modified nucleotides 
which can be used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodou- 
racil, hypoxanthine, xantine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiou- 
ridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 
1 -methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-meth- 
ylcytosine; N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5- 
oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 
5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2- 
carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced bio- 
logically using an expression vector into which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described 
further in the following subsection). 

[0122] The antisense nucleic acid molecules of the invention are typically administered to a subject or generated in 
situ such that they hybridize with or bind to cellular mRNA and/or genomic DNA encoding a GL50 polypeptide to thereby 
inhibit expression of the protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by conven- 
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tional nucleotide complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic acid 
molecule which binds to DNA duplexes, through specific interactions in the major groove of the double helix. An example 
of a route of administration of antisense nucleic acid molecules of the invention includes direct injection at a tissue site. 
Alternatively, antisense nucleic acid molecules can be modified to target selected cells and then administered system- 
ically. For example, for systemic administration, antisense molecules can be modified such that they specifically bind to 
receptors or antigens expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules to 
peptides or antibodies which bind to cell surface receptors or antigens. The antisense nucleic acid molecules can also 
be delivered to cells using the vectors described herein. To achieve sufficient intracellular concentrations of the antisense 
molecules, vector constructs in which the antisense nucleic acid molecule is placed under the control of a strong pol II 
or pol III promoter are preferred. 

[0123] In yet another embodiment, the antisense nucleic acid molecule of the invention is an a-anomeric nucleic acid 
molecule. An a-anomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA in 
which, contrary to the usual p-units, the strands run parallel to each other (Gaultier et al. (1 987) Nucleic Acids. Res. 1 5: 
6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et al. (1987) 
Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue (Inoue et al. (1987) FEBS Lett. 215:327-330). 
[0124] In still another embodiment, an antisense nucleic acid molecule of the invention is a ribozyme. Ribozymes are 
catalytic RNA molecules with ribonuclease activity which are capable of cleaving a single-stranded nucleic acid, such 
as an mRNA, to which they have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes (described 
in Haseloff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave GL50 mRNA transcripts to 
thereby inhibit translation of GL50 mRNA. A ribozyme having specificity for a GL50-encoding nucleic acid can be designed 
based upon the nucleotide sequence of a GL50 disclosed herein (e.g., SEQ ID NO:1 , 3, or 5). For example, a derivative 
of a TetrahymenaL^ 9 IVS RNA can be constructed in which the nucleotide sequence of the active site is complementary 
to the nucleotide sequence to be cleaved in a GL50-encoding mRNA. See, e.g., Cech et al. U.S. Patent No. 4,987,071 ; 
and Cech et al. U.S. Patent No. 5,1 16,742. Alternatively, GL50 mRNA can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel, D. and Szostak, J. W. (1993) Science 
261:1411-1418. 

[0125] Alternatively, GL50 gene expression can be inhibited by targeting nucleotide sequences complementary to the 
regulatory region of the GL50 (e.g., the GL50 promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the GL50 gene in target cells. See generally, Helene, C. (1991) Anticancer Drug Des. 6(6):569-84; 
Helene, C. et al. (1992) Ann. N.Y. Acad. Sci. 660:27-36; and Maher, L.J. (1992) Bioessays 1 4(1 2):807-15. 
[0126] In yet another embodiment, the GL50 nucleic acid molecules of the present invention can be modified at the 
base moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility of the molecule. 
For example, the deoxyribose phosphate backbone of the nucleic acid molecules can be modified to generate peptide 
nucleic acids (see Hyrup B. and Nielsen, P. E. (1 996) Bioorg. Med. Chem. 4(1 ):5-23). As used herein, the terms "peptide 
nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in which the deoxyribose phosphate backbone 
is replaced by a pseudopeptide backbone and only the four natural nucleobases are retained. The neutral backbone of 
PNAs has been shown to allow for specific hybridization to DNA and RNA under conditions af low ionic strength. The 
synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis protocols as described in 
Hyrup and Nielsen (1996) supra; Perry-O'Keefe et al. (1996) Proc. Natl. Acad. Sci. USA 93:14670-675. 
[0127] PNAs of GL50 nucleic acid molecules can be used in therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene expression by, for example, 
inducing transcription or translation arrest or inhibiting replication. PNAs of GL50 nucleic acid molecules can also be 
used in the analysis of single base pair mutations in a gene, (e.g., by PNA-directed PCR clamping); as 'artificial restriction 
enzymes' when used in combination with other enzymes, (e.g., S 1 nucleases (Hyrup and Nielsen (1 996) supra)); ox as 
probes or primers for DNA sequencing or hybridization (Hyrup and Nielsen (1 996) supra; Perry-O'Keefe (1 996) supra). 
[0128] In another embodiment, PNAs of GL50 can be modified, (e.g., to enhance their stability or cellular uptake), by 
attaching lipophilic or other helper groups to PNA, by the formation of PNA-DNA chimeras, or by the use of liposomes 
or other techniques of drug delivery known in the art. For example, PNA-DNA chimeras of GL50 nucleic acid molecules 
can be generated which may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA rec- 
ognition enzymes, (e.g., RNAse H and DNA polymerases), to interact with the DNA portion while the PNA portion would 
provide high binding affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate lengths selected 
in terms of base stacking, number of bonds between the nucleobases, and orientation (Hyrup and Nielsen (1 996) supra). 
The synthesis of PNA-DNA chimeras can be performed as described in Hyrup and Nielsen (1 996) supra and Finn P. J. 
et al. (1 996) Nucleic Acids Res. 24(1 7):3357-63. For example, a DNA chain can be synthesized on a solid support using 
standard phosphoramidite coupling chemistry and modified nucleoside analogs, e.g., 5'-(4-methoxytrityl)amino-5'-de- 
oxythymidine phosphoramidite, can be used as a between the PNA and the 5' end of DNA (Mag, M. et al. (1 989) Nucleic 
Acids Res. 17:5973-88). PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule with 
a 5' PNA segment and a 3' DNA segment (Finn et al. (1 996) supra). Alternatively, chimeric molecules can be synthesized 
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with a 5' DNA segment and a 3' PNA segment (Peterser, K. H. etal. (1975) Bioorganic Med. Chem. Lett. 5:1 119-1 1 124). 
[0129] In other embodiments, the oligonucleotide may include other appended groups such as peptides (e.g., for 
targeting host cell receptors in vivo), or agents facilitating transport across the cell membrane (see, e.g., Letsinger et 
al. (1989) Proc. Natl. Acad. Sci. USA 86:6553-6556; Lemaitre et al. (1987) Proc. Natl. Acad Sci. USA 84:648-652; PCT 
Publication No. W088/09810) or the blood-brain barrier (see, e.g., PCT Publication No. W089/10134). In addition, oli- 
gonucleotides can be modified with hybridization-triggered cleavage agents (See, e.g., Krol et al. (1988) Biotechniques 
6:958-976) or intercalating agents. (See, e.g., Zon (1 988) Pharm. Res. 5:539-549). To this end, the oligonucleotide may 
be conjugated to another molecule, (e.g., a peptide, hybridization triggered cross-linking agent, transport agent, or 
hybridization-triggered cleavage agent). 

III. Isolated GL50 polypeptides and Anti-GL50 Antibodies 

[0130] One aspect of the invention pertains to isolated GL50 polypeptides, and biologically active portions thereof, as 
well as polypeptide fragments suitable for use as immunogens to raise anti-GL50 antibodies. In one embodiment, native 
GL50 polypeptides can be isolated from cells or tissue sources by an appropriate purification scheme using standard 
protein purification techniques. In another embodiment, GL50 polypeptides are produced by recombinant DNA tech- 
niques. Alternative to recombinant expression, a GL50 polypeptide or polypeptide can be synthesized chemically using 
standard peptide synthesis techniques. 

[0131] An "isolated" or "purified" protein or biologically active portion thereof is substantially free of cellular material 
or other contaminating proteins from the cell or tissue source from which the GL50 polypeptide is derived, or substantially 
free from chemical precursors or other chemicals when chemically synthesized. The language "substantially free of 
cellular material" includes preparations of GL50 polypeptide in which the protein is separated from cellular components 
of the cells from which it is isolated or recombinantly produced. In one embodiment, the language "substantially free of 
cellular material" includes preparations of GL50 polypeptide having less than about 30% (by dry weight) of non-GL50 
polypeptide (also referred to herein as a "contaminating protein"), more preferably less than about 20% of non-GL50 
polypeptide, still more preferably less than about 10% of non-GL50 polypeptide, and most preferably less than about 
5% non-GL50 polypeptide. When the GL50 polypeptide or biologically active portion thereof is recombinantly produced, 
it is also preferably substantially free of culture medium, i.e., culture medium represents less than about 20%, more 
preferably less than about 1 0%, and most preferably less than about 5% of the volume of the protein preparation. 
[0132] The language "substantially free of chemical precursors or other chemicals" includes preparations of GL50 
polypeptide in which the protein is separated from chemical precursors or other chemicals which are involved in the 
synthesis of the protein. In one embodiment, the language "substantially free of chemical precursors or other chemicals" 
includes preparations of GL50 polypeptide having less than about 30% (by dry weight) of chemical precursors or non- 
GL50 chemicals, more preferably less than about 20% chemical precursors or non-GL50 chemicals, still more preferably 
less than about 10% chemical precursors or non-GL50 chemicals, and most preferably less than about 5% chemical 
precursors or non-GL50 chemicals. 

[0133] Another aspect of the invention pertains to isolated GL50 polypeptides. Preferably, the GL50 polypeptides 
comprise the amino acid sequence encoded by SEQ ID NO: 1, 3, or 5. In another preferred embodiment, the protein 
comprises the amino acid sequence of SEQ ID NO:2, 4, or 6. In other embodiments, the protein has at least 50%, at 
least 60 % amino acid identity, more preferably 70% amino acid identity, more preferably 80%, and even more preferably, 
90% or 95% amino acid identity with the amino acid sequence shown in SEQ ID NO:2, 4, or 6. 

[0134] In other embodiments, the invention provides isolated portions of a GL50 polypeptide. GL50 polypeptides 
comprising a GL50 polypeptide domain. Exemplary GL50 polypeptide domains are shown in Figure 12 and include, IgV- 
like, IgC-like, transmembrane, and cytoplasmic domains. 

[0135] The invention further pertains to soluble forms of GL50 polypeptides. Such forms can be naturally occurring 
or can be engineered and can comprise, e.g., an extracellular domain of a GL50 polypeptide. In one embodiment, the 
extracellular domain of a GL50 polypeptide comprises the IgV and IgC domains after cleavage of the signal sequence, 
but not the transmembrane and cytoplasmic domains of a GL50 polypeptide {e.g., corresponding to the amino acid 
sequence from about amino acid 47-279 of SEQ ID NO:2 or about amino acid 22-258 of SEQ ID NO:6). 
[0136] Biologically active portions of a GL50 polypeptide include peptides comprising amino acid sequences sufficiently 
homologous to or derived from the amino acid sequence of the GL50 polypeptide, which include less amino acids than 
the full length GL50 polypeptides, and exhibit at least one activity of a GL50 polypeptide. Typically, biologically active 
portions comprise a domain or motif with at least one activity of the GL50 polypeptide. A biologically active portion of a 
GL50 polypeptide can be a polypeptide which is, for example, at least 10, 25, 50, 100, 150, 200 or more amino acids in 
length. 

[01 37] To determine the percent identity of two amino acid sequences or of two nucleic acid sequences, the sequences 
are aligned for optimal comparison purposes {e.g., gaps can be introduced in one or both of a first and a second amino 
acid or nucleic acid sequence for optimal alignment). In a preferred embodiment, the length of a reference sequence 
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aligned for comparison purposes is at least 30%, preferably at least 40%, more preferably at least 50%, even more 
preferably at least 60%, and even more preferably at least 70%, 80%, or 90% of the length of the reference sequence. 
The residues at corresponding positions are then compared and when a position in one sequence is occupied by the 
same residue as the corresponding position in the other sequence, then the molecules are identical at that position. The 
percent identity between two sequences, therefore, is a function of the number of identical positions shared by two 
sequences (i.e., % identity = # of identical positions/total # of positions x 100). The percent identity between the two 
sequences is a function of the number of identical positions shared by the sequences, taking into account the number 
of gaps, and the length of each gap, which are introduced for optimal alignment of the two sequences. As used herein 
amino acid or nucleic acid "identity" is equivalent to amino acid or nucleic acid "homology". 

[0138] The comparison of sequences and determination of percent identity between two sequences can be accom- 
plished using a mathematical algorithm. A non-limiting example of a mathematical algorithm utilized for comparison of 
sequences is the algorithm of Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264, modified as in Karlin and 
Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873. Such an algorithm is incorporated into the N BLAST and XBLAST 
programs (version 2.0) of Altschul, et al. (1990) J. Mol. Biol. 215:403. BLAST nucleotide searches can be performed 
with the NBLAST program score =100, wordlength=12 to obtain nucleotide sequences homologous to the nucleic acid 
molecules ofthe invention. BLAST protein searches can be performed with the XBLAST program, score=50, wordlength 
=3 to obtain amino acid sequences homologous to the protein molecules of the invention. To obtain gapped alignments 
for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al., (1 997) Nucleic Acids Research 
25(17):3389. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs 
(e.g., XBLAST and NBLAST) can be used. See http ://www. ncbi. n I m . n i h .gov. Another preferred, non-limiting example 
of an algorithm utilized for the comparison of sequences is the algorithm of Myers and Miller, CABIOS (1 989). Such an 
algorithm is incorporated into the ALIGN program (version 2.0 or 2.0U) which is part of the GCG sequence alignment 
software package. When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight residue 
table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

[0139] As another example, the alignment program in Geneworks program (by Oxford Molecular; e.g., version 2.5.1) 
can be used with the parameters set as follows: gap creation = 1 6, extension penalty = 4, scoring matrix = fastadna.cmp, 
and a constant PAM factor. 

[0140] Another non-limiting example of a mathematical algorithm utilized for the alignment of protein sequences is 
the Lipman-Pearson algorithm (Lipman and Pearson (1985) Science 227:1435). When using the Lipman-Pearson al- 
gorithm, a PAM250 weight residue table, a gap length penalty of 12, a gap penalty of 4, and a Kutple of 2 can be used. 
A preferred, non-limiting example of a mathematical algorithm utilized for the alignment of nucleic acid sequences is the 
Wilbur-Lipman algorithm (Wilbur and Lipman (1 983) Proc. Natl. Acad Sci. USA 80:726). When using the Wilbur-Lipman 
algorithm, a window of 20, gap penalty of 3, Ktuple of 3 can be used. Both the Lipman-Pearson algorithm and the Wilbur- 
Lipman algorithm are incorporated, for example, into the MegAlign program (e.g., version 3.1.7) which is part of the 
DNASTAR sequence analysis software package. 

[0141] Additional algorithms for sequence analysis are known in the art, and include ADVANCE and ADAM., described 
in Torelli and Robotti (1994) Comput. Appl. Biosci. 10:3; and FASTA, described in Pearson and Lipman (1988) Proc. 
Natl. Acad. Sci. USA 85:2444. 

[0142] In a preferred embodiment, the percent identity between two amino acid sequences is determined using the 
GAP program in the GCG software package, using either a Blosum 62 matrix or a PAM250 matrix, and a gap weight of 
16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent 
identity between two nucleotide sequences is determined using the GAP program in the GCG software package, using 
a NWSgapdna. CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1 , 2, 3, 4, 5, or 6. 
[0143] Protein alignments can also be made using the Geneworks global protein alignment program (e.g., version 
2.5.1) with the cost to open gap set at 5, the cost to lengthen gap set at 5, the minimum diagonal length set at 4, the 
maximum diagonal offset set at 130, the consensus cutoff set at 50% and utilizing the Pam 250 matrix. 
[0144] The nucleic acid and protein sequences of the present invention can further be used as a "query sequence" 
to perform a search against public databases to, for example, identify other family members or related sequences. Such 
searches can be performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al. (1990) J. Mol. 
Biol. 21 5:403-1 0. BLAST nucleotide searches can be performed with the NBLAST program, score = 1 00, wordlength = 
1 2 to obtain nucleotide sequences homologous to GL50 nucleic acid molecules of the invention. BLAST protein searches 
can be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino acid sequences homologous 
to GL50 polypeptide molecules of the invention. To obtain gapped alignments for comparison purposes, Gapped BLAST 
can be utilized as described in Altschul et al. (1997) Nucleic Acids Res. 25(1 7):3389-3402. When utilizing BLAST and 
Gapped BLAST programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. 
For example, the nucleotide sequences of the invention can be analyzed using the default Blastn matrix 1-3 with gap 
penalties set at: existence 1 1 and extension 1. The amino acid sequences of the invention can be analyzed using the 
default settings: the Blosum62 matrix with gap penalties set at existence 11 and extension 1. See http://www.nc- 
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bi.nlm.nih.gov. 

[0145] The presence of divergent carboxyl regions on RACE clones illustrated by sequence alignments suggest that 
alternate signaling functions may be performed by these distinct molecules by the additional tyrosines in the intracellular 
domain of these molecules. To date, only sporadic studies have been performed to determine whether intracellular 
signaling for either B7-1 or B7-2. On the basis of the presence of cytoplasmic domain tyrosines on GL50 sequences, 
one can predict that such signaling events exist. Inspection of the cytoplasmic domains of mouse and human B7-1 and 
B7-2 show negligible similarity and it has also been suggested that the B7 cytoplasmic domain may be completely 
dispensable, based on the reported ability of B7 molecules to function in gpi-anchored constructs completely lacking 
cytoplasmic sequences. Accordingly, in one embodiment, tyrosine residues in the intracellular domain of a GL50 tyrosine 
molecule can be altered to modulate intracellular signally via a GL50 polypeptide. 

[0146] The invention also provides GL50 chimeric or fusion proteins. As used herein, a GL50 "chimeric protein" or 
"fusion protein" comprises a GL50 polypeptide operatively linked to a non-GL50 polypeptide. An "GL50 polypeptide" 
refers to a polypeptide having an amino acid sequence corresponding to GL50 polypeptide, whereas a "non-GL50 
polypeptide" refers to a polypeptide having an amino acid sequence corresponding to a protein which is not substantially 
homologous to the GL50 polypeptide, e.g., a protein which is different from the GL50 polypeptide and which is derived 
from the same or a different organism. Within a GL50 fusion protein the GL50 polypeptide can correspond to all or a 
portion of a GL50 polypeptide. In a preferred embodiment, a GL50 fusion protein comprises at least one biologically 
active portion of a GL50 polypeptide, e.g., an extracellular domain of a GL50 polypeptide. Within the fusion protein, the 
term "operatively linked" is intended to indicate that the GL50 polypeptide and the non-GL50 polypeptide are fused in- 
frame to each other. The non-GL50 polypeptide can be fused to the N-terminus or C-terminus of the GL50 polypeptide. 
[0147] For example, in one embodiment, the fusion protein is a GST-GL50 member fusion protein in which the GL50 
member sequences are fused to the C-terminus of the GST sequences. In another embodiment, the fusion protein is a 
GL50 member-HA fusion protein in which the GL50 member nucleotide sequence is inserted in a vector such as pCEP4- 
HA vector (Herrscher, R. F. et al. (1995) Genes Dev. 9:3067-3082) such that the GL50 member sequences are fused 
in frame to an influenza hemagglutinin epitope tag. Such fusion proteins can facilitate the purification of a recombinant 
GL50 member or can be used when a molecule that does not bind to an Fc receptor is desired. 

[0148] A GL50 fusion protein can be produced by recombinant expression of a nucleotide sequence encoding a first 
peptide having GL50 activity and a nucleotide sequence encoding second peptide corresponding to a moiety that alters 
the solubility, affinity, stability or valency of the first peptide, for example, an immunoglobulin constant region. Preferably, 
the first peptide consists of a portion of the of a GL50 polypeptide (e.g., a portion of amino acid residues (after cleavage 
of a signal sequence, e.g., corresponding to about amino acids 1-44 of SEQ ID NO:2) of the sequence shown in SEQ 
ID NO:2, 4, or 6 that is sufficient to costimulate activated T cells. The second peptide can include an immunoglobulin 
constant region, for example, a human Cy1 domain or Cy4 domain (e.g., the hinge, CH2 and CH3 regions of human 
IgCyl, or human lgCy4, see e.g., Capon et al. US patent 5,116,964, 5,580,756, 5,844,095 and the like, incorporated 
herein by reference). 

Particularly preferred GL50 Ig fusion proteins include the extracellular domain portion or variable region-like domain of 
a hGL50 coupled to an immunoglobulin constant region. The immunoglobulin constant region may contain genetic 
modifications which reduce or eliminate effector activity inherent in the immunoglobulin structure. For example, DNA 
encoding the extracellular portion of a GL50 polypeptide can be joined to DNA encoding the hinge, CH2 and CH3 regions 
of human IgCyl and/or lgCy4 modified by site directed mutagenesis, e.g., as taught in WO 97/28267. 
[0149] The nucleotide and amino acid sequences of exemplary soluble GL50 and ICOS constructs are presented in 
Figures 26-29. Figure 26 sets forth exemplary human ICOS fusion protein nucleic acid and amino acid sequence, Figure 
27 sets forth an exemplary murine ICOS fusion protein nucleic acid and amino acid sequence, Figure 28 sets forth an 
exemplary human GL50 fusion protein nucleic acid and amino acid sequence, and Figure 29 sets forth an exemplary 
murine GL50 fusion protein nucleic acid and amino acid sequence. 

[0150] A resulting GL50- Ig fusion protein may have altered solubility, binding affinity, stability and/or valency (i.e., the 
number of binding sites available per molecule) and may increase the efficiency of protein purification. Fusion proteins 
and peptides produced by recombinant techniques may be secreted and isolated from a mixture of cells and medium 
containing the protein or peptide. Alternatively, the protein or peptide may be retained cytoplasmically and the cells 
harvested, lysed and the protein isolated. A cell culture typically includes host cells, media and other byproducts. Suitable 
media for cell culture are well known in the art. Protein and peptides can be isolated from cell culture media, host cells, 
or both using techniques known in the art for purifying proteins and peptides. Techniques for transfecting host cells and 
purifying proteins and peptides are known in the art. 

[0151] Preferably, a GL50 fusion protein of the invention is produced by standard recombinant DNA techniques. For 
example, DNA fragments coding for the different polypeptide sequences are ligated together in-frame in accordance 
with conventional techniques, for example employing blunt-ended or stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment 
to avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can be synthesized by 
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conventional techniques including automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can 
be carried out using anchor primers which give rise to complementary overhangs between two consecutive gene frag- 
ments which can subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for example, 
Current Protocols in Molecular Biology, Ausubel et al. eds. John Wiley & Sons: 1992). Moreover, many expression 
vectors are commercially available that already encode a fusion moiety {e.g., a GST polypeptide or an HA epitope tag). 
A GL50 encoding nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked in- 
frame to the GL50 polypeptide. 

[0152] In another embodiment, the fusion protein is a GL50 polypeptide containing a heterologous signal sequence 
at its N-terminus. In certain host cells (e.g., mammalian host cells), expression and/or secretion of GLSOcan be increased 
through use of a heterologous signal sequence. 

[0153] The GL50 fusion proteins of the invention can be incorporated into pharmaceutical compositions and admin- 
istered to a subject in vivo. Use of GL50 fusion proteins may be useful therapeutically for the treatment of immunological 
disorders, e.g., autoimmune diseases or in the case of transplantation. Moreover, the GL50-fusion proteins of the invention 
can be used as immunogens to produce anti-GL50 antibodies in a subject, to purify GL50 ligands and in screening 
assays to identify molecules which inhibit the interaction of GL50 with a GL50 ligand. 

[01 54] The present invention also pertains to variants of the GL50 polypeptides which function as either GL50 agonists 
(mimetics) or as GL50 antagonists. Variants of the GL50 polypeptides can be generated by mutagenesis, e.g., discrete 
point mutation or truncation of a GL50 polypeptide. An agonist of the GL50 polypeptides can retain substantially the 
same, or a subset, of the biological activities of the naturally occurring form of a GL50 polypeptide. An antagonist of a 
GL50 polypeptide can inhibit one or more of the activities of the naturally occurring form of the GL50 polypeptide by, for 
example, competitively modulating a cellular activity of a GL50 polypeptide. Thus, specific biological effects can be 
elicited by treatment with a variant of limited function. In one embodiment, treatment of a subject with a variant having 
a subset of the biological activities of the naturally occurring form of the protein has fewer side effects in a subject relative 
to treatment with the naturally occurring form of the GL50 polypeptide. 

[0155] In one embodiment, variants of a GL50 polypeptide which function as either GL50 agonists (mimetics) or as 
GL50 antagonists can be identified by screening combinatorial libraries of mutants, e.g., truncation mutants, of a GL50 
polypeptide for GL50 polypeptide agonist or antagonist activity. In one embodiment, a variegated library of GL50 variants 
is generated by combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A 
variegated library of GL50 variants can be produced by, for example, enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a degenerate set of potential GL50 sequences is expressible as individual 
polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage display) containing the set of GL50 
sequences therein. There are a variety of methods which can be used to produce libraries of potential GL50 variants 
from a degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be performed in 
an automatic DNA synthesizer, and the synthetic gene then ligated into an appropriate expression vector. Use of a 
degenerate set of genes allows for the provision, in one mixture, of all of the sequences encoding the desired set of 
potential GL50 sequences. Methods for synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang, 
S. A. (1983) Tetrahedron 39:3; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198: 
1 056; Ike et al. (1 983) Nucleic Acids Res. 1 1 :477. 

[0156] In addition, libraries of fragments of a GL50 polypeptide coding sequence can be used to generate a variegated 
population of GL50 fragments for screening and subsequent selection of variants of a GL50 polypeptide. In one embod- 
iment, a library of coding sequence fragments can be generated by treating a double stranded PCR fragment of a GL50 
coding sequence with a nuclease under conditions wherein nicking occurs only about once per molecule, denaturing 
the double stranded DNA, renaturing the DNA to form double stranded DNA which can include sense/antisense pairs 
from different nicked products, removing single stranded portions from reformed duplexes by treatment with S 1 nuclease, 
and ligating the resulting fragment library into an expression vector. By this method, an expression library can be derived 
which encodes N-terminal, C-terminal and internal fragments of various sizes of the GL50 polypeptide. 
[0157] Several techniques are known in the art for screening gene products of combinatorial libraries made by point 
mutations or truncation, and for screening cDNA libraries for gene products having a selected property. Such techniques 
are adaptable for rapid screening of the gene libraries generated by the combinatorial mutagenesis of GL50 polypeptides. 
The most widely used techniques, which are amenable to high through-put analysis, for screening large gene libraries 
typically include cloning the gene library into replicable expression vectors, transforming appropriate cells with the 
resulting library of vectors, and expressing the combinatorial genes under conditions in which detection of a desired 
activity facilitates isolation of the vector encoding the gene whose product was detected. Recursive ensemble mutagen- 
esis (REM), a new technique which enhances the frequency of functional mutants in the libraries, can be used in 
combination with the screening assays to identify GL50 variants (Arkin and Youvan (1992) Proc. Natl. Acad. Sci. USA 
89:7811-7815; Delagrave et al. (1993) Protein Eng. 6(3):327-331 ). 

[0158] In one embodiment, cell based assays can be exploited to analyze a variegated GL50 library. For example, a 
library of expression vectors can be transfected into a cell line which ordinarily synthesizes and secretes GL50. The 
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transfected cells are then cultured such that GL50 and a particular mutant GL50 are secreted and the effect of expression 
of the mutant on GL50 activity in cell supernatants can be detected, e.g., by any of a number of enzymatic assays. 
Plasmid DNA can then be recovered from the cells which score for inhibition, or alternatively, potentiation of GL50 
activity, and the individual clones further characterized. 

[0159] In addition to GL50 polypeptides consisting only of naturally-occurring amino acids, GL50 peptidomimetics are 
also provided. Peptide analogs are commonly used in the pharmaceutical industry as non-peptide drugs with properties 
analogous to those of the template peptide. These types of non-peptide compound are termed "peptide mimetics" or 
"peptidomimetics" (Fauchere, J. (1986) Adv. Drug Res. 15:29; Veber and Freidinger (1985) TINS p. 392; and Evans et 
al. (1987) J. Med. Chem. 30:1229, which are incorporated herein by reference) and are usually developed with the aid 
of computerized molecular modeling. Peptide mimetics that are structurally similar to therapeutically useful peptides can 
be used to produce an equivalent therapeutic or prophylactic effect. Generally, peptidomimetics are structurally similar 
to a paradigm polypeptide (i.e., a polypeptide that has a biological or pharmacological activity), such as human GL50, 
but have one or more peptide linkages optionally replaced by a linkage selected from the group consisting of: -CH2NH-, 
-CH2S-, -CH2-CH2-, -CH=CH- (cis and trans), -COCH2-, -CH(OH)CH2-, and -CH2SO-, by methods known in the art 
and further described in the following references: Spatola, A. F. in "Chemistry and Biochemistry of Amino Acids, Peptides, 
and Proteins" Weinstein, B., ed., Marcel Dekker, New York, p. 267 (1 983); Spatola, A. F., Vega Data (March 1 983), Vol. 
1, Issue 3, "Peptide Backbone Modifications" (general review); Morley, J. S. (1980) Trends Pharm. Sci. pp. 463-468 
(general review); Hudson, D. et al. (1979) Int. J. Pept. Prot. Res. 14:177-185 (-CH2NH-, CH2CH2-); Spatola, A. F. et 
al. (1986) Life Sci. 38:1243-1249 (-CH2-S); Hann, M. M. (1982) J. Chem. Soc. Perkin Trans. I. 307-314 (-CH-CH-, cis 
and trans); Almquist, R. G. et al. (190) J. Med. Chem. 23:1 392-1 398(-COCH2-); Jennings-White, C. et al. (1982) Tet- 
rahedron Lett. 23:2533 (-COCH2-); Szelke, M. et al. European Appln. EP 45665 (1982) CA: 97:39405 (1 982)(-CH(OH) 
CH2-); Holladay, M. W. et al. (1 983) Tetrahedron Lett. (1 983) 24:4401-4404 (-C(OH)CH2-); and Hruby, V. J. (1 982) Life 
Sci. (1 982) 31 :1 89-1 99 (-CH2-S-); each of which is incorporated herein by reference. A particularly preferred non-peptide 
linkage is -CH2NH-. Such peptide mimetics may have significant advantages over polypeptide embodiments, including, 
for example: more economical production, greater chemical stability, enhanced pharmacological properties (half-life, 
absorption, potency, efficacy, etc.), altered specificity {e.g., a broad-spectrum of biological activities), reduced antigenicity, 
and others. Labeling of peptidomimetics usually involves covalent attachment of one or more labels, directly or through 
a spacer (e.g., an amide group), to non-interfering position(s) on the peptidomimetic that are predicted by quantitative 
structure-activity data and/or molecular modeling. Such non-interfering positions generally are positions that do not form 
direct contacts with the macromolecules(s) to which the peptidomimetic binds to produce the therapeutic effect. Deriv- 
itization (e.g., labeling) of peptidomimetics should not substantially interfere with the desired biological or pharmacological 
activity of the peptidomimetic. 

[0160] Systematic substitution of one or more amino acids of a GL50 amino acid sequence with a D-amino acid of 
the same type (e.g., D-lysine in place of L-lysine) can be used to generate more stable peptides. In addition, constrained 
peptides comprising a GL50 amino acid sequence or a substantially identical sequence variation can be generated by 
methods known in the art (Rizo and Gierasch (1992) Annu. Rev. Biochem. 61 :387, incorporated herein by reference); 
for example, by adding internal cysteine residues capable of forming intramolecular disulfide bridges which cyclize the 
peptide. 

[01 61 ] The amino acid sequences of GL50 polypeptides identified herein will enable those of skill in the art to produce 
polypeptides corresponding to GL50 peptide sequences and sequence variants thereof. Such polypeptides can be 
produced in prokaryotic or eukaryotic host cells by expression of polynucleotides encoding a GL50 peptide sequence, 
frequently as part of a larger polypeptide. Alternatively, such peptides can be synthesized by chemical methods. Methods 
for expression of heterologous proteins in recombinant hosts, chemical synthesis of polypeptides, and in vitro translation 
are well known in the art and are described further in Maniatis et al. Molecular Cloning: A Laboratory Manual (1989), 
2nd Ed., Cold Spring Harbor, N.Y.; Berger and Kimmel, Methods in Enzymology, Volume 152, Guide to Molecular Cloning 
Techniques (1987), Academic Press, Inc., San Diego, Calif.; Merrifield, J. (1969) J. Am. Chem. Soc. 91:501; Chaiken 
I. M. (1 981 ) CRC Crit. Rev. Biochem, 1 1 : 255; Kaiser et al. (1 989) Science 243:1 87; Merrifield, B. (1 986) Science 232: 
342; Kent, S. B. H. (1 988) Annu. Rev. Biochem. 57:957; and Offord, R. E. (1 980) Semisynthetic Proteins, Wiley Publishing, 
which are incorporated herein by reference). 

[0162] Peptides can be produced, typically by direct chemical synthesis, and used e.g., as agonists or antagonists of 
a GL50/GL50 ligand interaction. Peptides can be produced as modified peptides, with nonpeptide moieties attached by 
covalent linkage to the N-terminus and/or C-terminus. In certain preferred embodiments, either the carboxy-terminus or 
the amino-terminus, or both, are chemically modified. The most common modifications of the terminal amino and carboxyl 
groups are acetylation and amidation, respectively. Ammo-terminal modifications such as acylation (e.g., acetylation) 
or alkylation (e.g., methylation) and carboxy-terminal-modifications such as amidation, as well as other terminal modi- 
fications, including cyclization, can be incorporated into various embodiments of the invention. Certain amino-terminal 
and/or carboxy-terminal modifications and/or peptide extensions to the core sequence can provide advantageous phys- 
ical, chemical, biochemical, and pharmacological properties, such as: enhanced stability, increased potency and/or 
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efficacy, resistance to serum proteases, desirable pharmacokinetic properties, and others. Peptides can be used ther- 
apeutically to treat disease, e.g., by altering costimulation in a patient. 

[0163] An isolated GL50 polypeptide, or a portion or fragment thereof, can be used as an immunogen to generate 
antibodies that bind GL50 using standard techniques for polyclonal and monoclonal antibody preparation. A full-length 
GL50 polypeptide can be used or, alternatively, the invention provides antigenic peptide fragments of GL50 for use as 
immunogens. The antigenic peptide of GL54 comprises at least 8 amino acid residues and encompasses an epitope of 
GL50 such that an antibody raised against the peptide forms a specific immune complex with GL50. Preferably, the 
antigenic peptide comprises at least 10 amino acid residues, more preferably at least 15 amino acid residues, even 
more preferably at least 20 amino acid residues, and most preferably at least 30 amino acid residues. 
[01 64] Alternatively, an antigenic peptide fragment of a GL50 polypeptide can be used as the immunogen. An antigenic 
peptide fragment of a GL50 polypeptide typically comprises at least 8 amino acid residues of the amino acid sequence 
shown in SEQ ID NO:2, 4, or 6 and encompasses an epitope of a GL50 polypeptide such that an antibody raised against 
the peptide forms an immune complex with a GL50 molecule. Preferred epitopes encompassed by the antigenic peptide 
are regions of GL50 that are located on the surface of the protein, e.g., hydrophilic regions. In one embodiment, an 
antibody binds substantially specifically to a GL50 molecule. In another embodiment, an antibody binds specifically to 
a GL50 polypeptide. 

[0165] Preferably, the antigenic peptide comprises at least about 10 amino acid residues, more preferably at least 
about 15 amino acid residues, even more preferably at least 20 about amino acid residues, and most preferably at least 
about 30 amino acid residues. Preferred epitopes encompassed by the antigenic peptide are regions of a GL50 polypep- 
tide that are located on the surface of the protein, e.g., hydrophilic regions, and that are unique to a GL50 polypeptide. 
In one embodiment such epitopes can be specific for a GL50 polypeptides from one species, such as mouse or human 
(i.e., an antigenic peptide that spans a region of a GL50 polypeptide that is not conserved across species is used as 
immunogen; such non conserved residues can be determined using an alignment such as that provided herein). A 
standard hydrophobicity analysis of the GL50 polypeptide can be performed to identify hydrophilic regions. 
[0166] A GL50 immunogen typically is used to prepare antibodies by immunizing a suitable subject, (e.g., rabbit, goat, 
mouse or other mammal) with the immunogen. An appropriate immunogenic preparation can contain, for example, a 
recombinantly expressed GL50 polypeptide or a chemically synthesized GL50 peptide. The preparation can further 
include an adjuvant, such as Freund's complete or incomplete adjuvant, or similar immunostimulatory agent. Immuni- 
zation of a suitable subject with an immunogenic GL50 preparation induces a polyclonal anti- GL50 antibody response. 
[01 67] Accordingly, another aspect of the invention pertains to anti-GL50 antibodies. Polyclonal anti-GL50 antibodies 
can be prepared as described above by immunizing a suitable subject with a GL50 immunogen. The anti-GL50 antibody 
titer in the immunized subject can be monitored over time by standard techniques, such as with an enzyme linked 
immunosorbent assay (ELISA) using immobilized a GL50 polypeptide. If desired, the antibody molecules directed against 
a GL50 polypeptide can be isolated from the mammal (e.g., from the blood) and further purified by well known techniques, 
such as protein A chromatography to obtain the IgG fraction. At an appropriate time after immunization, e.g., when the 
anti- GL50 antibody titers are highest, antibody-producing cells can be obtained from the subject and used to prepare 
monoclonal antibodies by standard techniques, such as the hybridoma technique originally described by Kohler and 
Milstein (1975) Nature 256:495-497 (see also, Brown et al. (1981) J. Immunol 127:539-46; Brown et al. (1980) J. Biol. 
Chem. 255:4980-83; Yeh et al. (1976) Proc. Natl. Acad. Sci. USA 76:2927-31 ; and Yeh et al. (1982) Int. J. Cancer 29: 
269-75), the more recent human B cell hybridoma technique (Kozbor et al. (1983) Immunol. Today 4:72), the EBV- 
hybridoma technique (Cole et al. (1985) Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or 
trioma techniques. The technology for producing monoclonal antibody hybridomas is well known (see generally Kenneth, 
R. H. in Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing Corp., New York, New 
York (1980); Lerner, E. A. (1981) Yale J. Biol. Med. 54:387-402; Gefter, M. L. et al. (1977) Somatic Cell Genet., 3: 
231 -36). Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes (typically splenocytes) from a mammal 
immunized with a GL50 immunogen as described above, and the culture supernatants of the resulting hybridoma cells 
are screened to identify a hybridoma producing a monoclonal antibody that binds specifically to a GL50 polypeptide. 
[0168] Any of the many well known protocols used for fusing lymphocytes and immortalized cell lines can be applied 
for the purpose of generating an anti-GL50 monoclonal antibody (see, e.g., Galfre, G. et al. (1977) Nature 266:55052; 
Gefter et al. (1977) supra; Lerner (1981) supra; Kenneth, Monoclonal Antibodies, supra). Moreover, the ordinary skilled 
worker will appreciate that there are many variations of such methods which also would be useful. Typically, the immortal 
cell line (e.g., a myeloma cell line) is derived from the same mammalian species as the lymphocytes. For example, 
murine hybridomas can be made by fusing lymphocytes from a mouse immunized with an immunogenic preparation of 
the present invention with an immortalized mouse cell line. Preferred immortal cell lines are mouse myeloma cell lines 
that are sensitive to culture medium containing hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a 
number of myeloma cell lines may be used as a fusion partner according to standard techniques, e.g., the P3-NS1/1- 
Ag4-1, P3-x63-Ag8.653 or Sp2/0-Ag14 myeloma lines. These myeloma lines are available from the American Type 
Culture Collection (ATCC), Rockville, Md. Typically, HAT-sensitive mouse myeloma cells are fused to mouse splenocytes 
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using polyethylene glycol ("PEG"). Hybridoma cells resulting from the fusion are then selected using HAT medium, which 
kills unfused and unproductively fused myeloma cells (unfused splenocytes die after several days because they are not 
transformed). Hybridoma cells producing a monoclonal antibody of the invention are detected by screening the hybridoma 
culture supernatants for antibodies that bind a GL50 molecule, e.g., using a standard ELISA assay. 
[0169] As an alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal anti-GL50 antibody 
can be identified and isolated by screening a recombinant combinatorial immunoglobulin library {e.g., an antibody phage 
display library) with a GL50 to thereby isolate immunoglobulin library members that bind a GL50 polypeptide. Kits for 
generating and screening phage display libraries are commercially available (e.g., the Pharmacia Recombinant Phage 
Antibody System, Catalog No. 27-9400-01; and the Stratagene SurfZAP™ Phage Display Kit, Catalog No. 240612). 
Additionally, examples of methods and reagents particularly amenable for use in generating and screening antibody 
display library can be found in, for example, Ladner etal. U.S. Patent No. 5,223,409; Kang etai. International Publication 
No. WO 92/18619; Dower etai. International Publication No. WO 91/17271; Winter etai. International Publication WO 
92/20791; Markland et ai. International Publication No. WO 92/15679; Breitling et ai. International Publication WO 
93/01288; McCafferty etai. International Publication No. WO 92/01047; Garrard etai. International Publication No. WO 
92/09690; Ladner etai. International Publication No. WO 90/02809; Fuchs et al. (1 991 ) Biotechnology (NY) 9:1 369-1 372; 
Hay et al. (1992) Hum. Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275-1281; Griffiths et al. (1993) 
EMBO J. 12:725-734; Hawkins et al. (1992) J. Mol. Biol. 226:889-896; Clarkson etal. (1991) Nature 352:624-628; Gram 
et al. (1992) Proc. Natl. Acad Sci. USA 89:3576-3580; Garrard et al. (1991) Biotechnology (NY) 9:1373-1377; Hoogen- 
boom et al. (1991) Nucleic Acids Res. 19:4133-4137; Barbas et al. (1991) Proc. Natl. Acad. Sci. USA 88:7978-7982; 
and McCafferty et al. (1990) Nature 348:552-554. 

[0170] Additionally, recombinant anti- GL50 antibodies, such as chimeric and humanized monoclonal antibodies, 
comprising both human and non-human portions, which can be made using standard recombinant DNA techniques, are 
within the scope of the invention. Such chimeric and humanized monoclonal antibodies can be produced by recombinant 
DNA techniques known in the art, for example using methods described in Robinson etai. International Patent Publication 
PCT/US86/02269; Akiraefa/. European Patent Application 1 84,1 87; Taniguchi, M. European Patent Application 171,496; 
Morrison etai. European Patent Application 173,494; Neuberger etai. PCT Application WO 86/01533; Cabilly etai. U.S. 
Patent No. 4,81 6,567; Cabilly et ai. European Patent Application 1 25,023; Better et al. (1 988) Science 240:1 041 -1 043; 
Liu et al. (1 987) Proc. Natl. Acad. Sci. USA 84:3439-3443; Liu et al. (1 987) J. Immunol. 1 39:3521 -3526; Sun et al. (1 987) 
Proc. Natl. Acad. Sci. 84:214-218; Nishimura et al. (1987) Cancer Res. 47:999-1005; Wood et al. (1985) Nature 314: 
446-449; and Shaw et al. (1988) J. Natl. Cancer Inst. 80:1553-1559); Morrison, S. L. (1985) Science 229:1202-1207; 
Oi et al. (1 986) Biotechniques 4:21 4; Winter U.S. Patent 5,225,539; Jones et al. (1 986) Nature 321 :552-525; Verhoeyan 
et al. (1 988) Science 239:1 534; and Beidler et al. (1 988) J. Immunol. 1 41 :4053-4060. 

[0171] In addition, humanized antibodies can be made according to standard protocols such as those disclosed in US 
patent 5,565,332. In another embodiment, antibody chains or specif binding pair members can be produced by recom- 
bination between vectors comprising nucleic acid molecules encoding a fusion of a polypeptide chain of a specific binding 
pair member and a component of a replicable geneic display package and vectors containing nucleic acid molecules 
encoding a second polypeptide chain of asingle binding pair memberusing techniques known in the art, e.g., as described 
in US patents 5,565,332, 5,871 ,907, or 5,733,743. The use of intracellular antibodies to inhibit protein function in a cell 
is also known in the art (see e.g., Carlson, J. R. (1988) Mol. Cell. Biol. 8:2638-2646; Biocca, S. et al, (1990) EMBO J. 
9:101-108; Werge, T. M. et al. (1990) FEBS Lett. 274:193-198; Carlson, J. R. (1993) Proc. Natl. Acad Sci. USA 90: 
7427-7428; Marasco, W. A. et al. (1 993) Proc. Natl. Acad Sci. USA 90:7889-7893; Biocca, S. et al. (1 994) Biotechnology 
(NY) 1 2:396-399; Chen, S-Y. et al. (1 994) Hum. Gene Ther. 5:595-601 ; Duan, L et al. (1 994) Proc. Natl. Acad. Sci. USA 
91:5075-5079; Chen, S-Y. et al. (1994) Proc. Natl. Acad. Sci. USA 91:5932-5936; Beerli, R. R. et al. (1994) J. Biol. 
Chem. 269:23931-23936; Beerli, R. R. et al. (1994) Biochem. Biophys. Res. Commun. 204:666-672; Mhashilkar, A. M. 
et al. (1995) EMBO J. 14:1542-1551; Richardson, J. H. et al. (1995) Proc. Natl. Acad. Sci. USA 92:3137-3141; PCT 
Publication No. WO 94102610 by Marasco ef a/.;and PCT Publication No. WO 95/03832 by Duan etai.). 
[0172] In one embodiment, an antibody for use in the instant invention is a bispecific antibody. A bispecific antibody 
has binding sites for two different antigens within a single antibody molecule. Antigen binding may be simultaneous or 
sequential. Triomas and hybrid hybridomas are two examples of cell lines that can secrete bispecific antibodies. Examples 
of bispecific antibodies produced by a hybrid hybridoma or a trioma are disclosed in U.S. Pat. 4,474,893. Bispecific 
antibodies have been constructed by chemical means (Staerz et al. (1985) Nature 314:628, and Perez et al. (1985) 
Nature 316:354) and hybridoma technology (Staerz and Bevan (1986) Proc. Natl. Acad. Sci. USA, 83:1453, and Staerz 
and Bevan (1 986) Immunol. Today 7:241 ). Bispecific antibodies are also described in U.S. patent 5,959,084. Fragments 
of bispecific antibodies are described in US patent 5,798,229. 

[0173] Bispecific agents can also be generated by making heterohybridomas by fusing hybridomas or other cells 
making different antibodies, followed by identification of clones producing and co-assembling both antibodies. They can 
also be generated by chemical or genetic conjugation of complete immunoglobulin chains or portions thereof such as 
Fab and Fv sequences. For example, bispecific agents that bind to the T cell receptor complex, the B cell receptor 
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complex, CD40, CD40 ligand, CD2, or CD45 (in addition ot GL50 or ICOS) can be developed. 

[0174] An anti- GL50 antibody (e.g., monoclonal antibody) can be used to isolate a GL50 polypeptide by standard 
techniques, such as affinity chromatography or immunoprecipitation. Anti-GL50 antibodies can facilitate the purification 
of natural GL50 polypeptides from cells and of recombinantly produced GL50 polypeptides expressed in host cells. 
Moreover, an anti- GL50 antibody can be used to detect a GL50 polypeptide {e.g., in a cellular lysate or cell supernatant). 
In addition, antibodies to GL50 can be used to block the interaction between GL50 and a ligand or binding partner. 
Detection can be facilitated by coupling (i.e., physically linking) the antibody to a detectable substance. Accordingly, in 
one embodiment, an anti-GL50 antibody of the invention is labeled with a detectable substance. Examples of detectable 
substances include various enzymes, prosthetic groups, fluorescent materials, luminescent materials and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, (3-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; ex- 
amples of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlo- 
rotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes luminol; 
and examples of suitable radioactive material include 125 l, 131 1, 35 S, and 3 H. 

[01 75] Yet another aspect of the invention pertains to anti-GL50 antibodies that are obtainable by a process comprising : 

(a) immunizing an animal with an immunogenic GL50 polypeptide, or an immunogenic portion thereof unique to a 
GL50 polypeptide; and 

(b) isolating from the animal antibodies that specifically bind to a GL50 polypeptide. 
IV. Recombinant Expression Vectors and Host Cells 

[0176] Another aspect of the invention pertains to vectors, preferably expression vectors, containing a nucleic acid 
encoding a GL50 family protein (or a portion thereof). As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers 
to a circular double stranded DNA loop into which additional DNA segments can be ligated. Another type of vector is a 
viral vector, wherein additional DNA segments can be ligated into the viral genome. Certain vectors are capable of 
autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated 
into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. 
Moreover, certain vectors are capable of directing the expression of genes to which they are operatively linked. Such 
vectors are referred to herein as "expression vectors". In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids. In the present specification, "plasmid" and "vector" can be used inter- 
changeably as the plasmid is the most commonly used form of vector. However, the invention is intended to include 
such other forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and 
adeno-associated viruses), which serve equivalent functions. 

[01 77] The recombinant expression vectors of the invention comprise a nucleic acid of the invention in a form suitable 
for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or 
more regulatory sequences, selected on the basis of the host cells to be used for expression, which is operatively linked 
to the nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably linked" is intended to 
mean that the nucleotide sequence of interest is linked to the regulatory sequence(s) in a manner which allows for 
expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the 
vector is introduced into the host cell). The term "regulatory sequence" is intended to includes promoters, enhancers 
and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel (1990) Methods Enzymol. 185:3-7. Regulatory sequences include those which direct constitutive 
expression of a nucleotide sequence in many types of host cells and those which direct expression of the nucleotide 
sequence only in certain host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by those skilled in 
the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, 
the level of expression of protein desired, and the like. The expression vectors of the invention can be introduced into 
host cells to thereby produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., GL50 family proteins, mutant forms of GL50 polypeptides or portions thereof, fusion proteins, 
and the like). 

[0178] In one embodiment of the invention, vectors comprising only a transmembrane or intracellular domain of a 
GL50 molecule can be engineered. Such constructs can be used to modulate intracellular signaling via GL50 molecules, 
e.g., and act as dominant negative mutants. 

[0179] The recombinant expression vectors of the invention can be designed for expression of GL50 polypeptides in 
prokaryotic or eukaryotic cells. For example, GL50 polypeptides can be expressed in bacterial cells such as E. coli, 
insect cells (using baculovirus expression vectors) yeast cells or mammalian cells. Suitable host cells are discussed 
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further in Goeddel (1990) supra. Alternatively, the recombinant expression vector can be transcribed and translated in 
vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

[0180] Expression of proteins in prokaryotes is most often carried out in E. coli with vectors containing constitutive or 
inducible promoters directing the expression of either fusion or non-fusion proteins. Fusion vectors add a number of 
amino acids to a protein encoded therein, usually to the amino terminus of the recombinant protein. Such fusion vectors 
typically serve three purposes: 1) to increase expression of recombinant protein; 2) to increase the solubility of the 
recombinant protein; and 3) to aid in the purification of the recombinant protein by acting as a ligand in affinity purification. 
Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the 
recombinant protein to enable separation of the recombinant protein from the fusion moiety subsequent to purification 
of the fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and enter- 
okinase. Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D. B. and Johnson, K. S. (1 988) 
Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glu- 
tathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the target recombinant protein. 
[0181] Purified fusion proteins can be utilized therapeutically, in GL50 activity assays, (e.g., direct assays or competitive 
assays described in detail below), or to generate antibodies specific for GL50 polypeptides, for example. 
[0182] Examples of suitable inducible non-fusion E. coli expression vectors include pTrc (Amann et al. (1988) Gene 
69:301 -31 5) and pET 1 1 d (Studier et al. (1 990) Methods Enzymol. 1 85:60-89). Target gene expression from the pTrc 
vector relies on host RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene expression from 
the pET 1 1d vector relies on transcription from a T7 gn10-lac fusion promoter mediated by a coexpressed viral RNA 
polymerase (T7 gnl). This viral polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter. 

[0183] One strategy to maximize recombinant polypeptide expression in E. coli \s to express the polypeptide in a host 
bacteria with an impaired capacity to proteolytically cleave the recombinant polypeptide (Gottesman, S. (1 990) Methods 
Enzymol. 1 85:1 1 9-1 28). Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those preferentially utilized in E, coli (Wada et 
al. (1 992) Nucleic Acids Res. 20:21 11-211 8). Such alteration of nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 

[0184] In another embodiment, the GL50 expression vector is a yeast expression vector. Examples of vectors for 
expression in yeast S. cerivisae include pYepSed (Baldari et al. (1987) EMBO J. 6:229-234), pMFa (Kurjan and Her- 
skowitz (1 982) Cell 30:933-943), pJRY88 (Schultz et al. (1 987) Gene 54:1 1 3-123), pYES2 (Invitrogen Corporation, San 
Diego, CA), and picZ (Invitrogen Corp, San Diego, CA). 

[0185] Alternatively, a GL50 polypeptide can be expressed in insect cells using baculovirus expression vectors. Bac- 
ulovirus vectors available for expression of polypeptides in cultured insect cells (e.g., Sf9 cells) include the pAc series 
(Smith etal. (1983) Mol. Cell Biol. 3:2156-2165) and the pVL series (Lucklow, V. A., and Summers, M. D. (1 989) Virology 
170:31-39). 

[0186] In yet another embodiment, a nucleic acid molecule of the invention is expressed in mammalian cells using a 
mammalian expression vector. Examples of mammalian expression vectors include pMex-Neol, pCDM8 (Seed, B. (1 987) 
Nature 329:840) and pMT2PC (Kaufman et al. (1987) EMBO J. 6:187-195). When used in mammalian cells, the expres- 
sion vector's control functions are often provided by viral regulatory elements. For example, commonly used promoters 
are derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40. For other suitable expression systems 
for both prokaryotic and eukaryotic cells see chapters 1 6 and 1 7 of Sambrook, J. et al. Molecular Cloning: A Laboratory 
Manual. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 
[0187] In another embodiment, the recombinant mammalian expression vector is capable of directing expression of 
the nucleic acid preferentially in a particular cell type (e.g., tissue-specific regulatory elements are used to express the 
nucleic acid). Tissue-specific regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific 
promoters include the albumin promoter (liver-specific; Pinkert et al. (1987) Genes Dev. 1:268-277), lymphoid-specific 
promoters (Calame and Eaton (1988) Adv. Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto 
and Baltimore (1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 33:729-740; Queen and 
Baltimore (1983) Cell 33:741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle 
(1989) Proc. Natl. Acad Sci. USA 86:5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 230: 
91 2-91 6), and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4,873,31 6 and European 
Application Publication No. 264,166). Developmentally-regulated promoters are also encompassed, for example the 
murine hox promoters (Kessel and Gruss (1990) Science 249:374-379) and the a-fetoprotein promoter (Campes and 
Tilghman (1989) Genes Dev. 3:537-546). 

[0188] Moreover, inducible regulatory systems for use in mammalian cells are known in the art, for example systems 
in which gene expression is regulated by heavy metal ions (see e.g., Mayo et al. (1982) Cell 29:99-108; Brinster et al. 
(1982) Nature 296:39-42; Searle et al. (1985) Mol. Cell. Biol. 5:1480-1489), heat shock (see e.g., Nouer et al. (1991) in 
Heat Shock Response, Nouer, L., ed. CRC, Boca Raton , FL, pp1 67-220), hormones (see e.g., Lee et al. (1 981 ) Nature 
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294:228-232; Hynes et al. (1981) Proc. Natl. Acad. Sci. USA 78:2038-2042; Klock et al. (1987) Nature 329:734-736; 
Israel and Kaufinan (1989) Nucleic Acids Res. 17:2589-2604; and PCT Publication No. WO 93/23431), FK506-related 
molecules (see e.g., PCT Publication No. WO 94/18317) or tetracyclines (Gossen, M. and Bujard, H. (1992) Proc. Natl. 
Acad. Sci. USA 89:5547-5551 ; Gossen, M. et al. (1995) Science 268:1766-1769; PCT Publication No. WO 94/29442; 
and PCT Publication No. WO 96/01313). Accordingly, in another embodiment, the invention provides a recombinant 
expression vector in which a GL50 DNA is operatively linked to an inducible eukaryotic promoter, thereby allowing for 
inducible expression of a GL50 polypeptide in eukaryotic cells. 

[0189] The invention further provides a recombinant expression vector comprising a DNA molecule of the invention 
cloned into the expression vector in an antisense orientation. That is, the DNA molecule is operatively linked to a 
regulatory sequence in a manner which allows for expression (by transcription of the DNA molecule) of an RNA molecule 
which is antisense to GL50 mRNA. Regulatory sequences operatively linked to a nucleic acid molecule cloned in the 
antisense orientation can be chosen which direct the continuous expression of the antisense RNA molecule in a variety 
of cell types, for instance viral promoters and/or enhancers, or regulatory sequences can be chosen which direct con- 
stitutive, tissue specific or cell type specific expression of antisense RNA. The antisense expression vector can be in 
the form of a recombinant plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under 
the control of a high efficiency regulatory region, the activity of which can be determined by the cell type into which the 
vector is introduced. For a discussion of the regulation of gene expression using antisense genes see Weintraub, H. et 
al., Antisense RNA as a molecular tool for genetic analysis, Reviews - Trends in Genetics, Vol. 1(1) 1986. 
[0190] Another aspect of the invention pertains to host cells into which a recombinant expression vector of the invention 
has been introduced. The terms "host cell" and "recombinant host cell" are used interchangeably herein. It is understood 
that such terms refer not only to the particular subject cell but to the progeny or potential progeny of such a cell. Because 
certain modifications may occur in succeeding generations due to either mutation or environmental influences, such 
progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein. 
[0191] A host cell can be any prokaryotic or eukaryotic cell. For example, a GL50 polypeptide can be expressed in 
bacterial cells such as E. cofi, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or 
COS cells). Other suitable host cells are known to those skilled in the art. 

[01 92] Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection 
techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a variety of art-recog- 
nized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, orelectroporation. Suitable methods for trans- 
forming or transfecting host cells can be found in Sambrook, et al. (Molecular Cloning: A Laboratory Manual. 2nd, ed., 
Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989), and other labo- 
ratory manuals. 

[0193] For stable transfection of mammalian cells, it is known that, depending upon the expression vector and trans- 
fection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify 
and select these integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally intro- 
duced into the host cells along with the gene of interest. Preferred selectable markers include those which confer 
resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be 
introduced into a host cell on the same vector as that encoding a GL50 polypeptide or can be introduced on a separate 
vector. Cells stably transfected with the introduced nucleic acid can be identified by drug selection (e.g., cells that have 
incorporated the selectable marker gene will survive, while the other cells die). 

[0194] A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can be used to produce 
(i.e., express) a GL50 polypeptide. Accordingly, the invention further provides methods for producing a GL50 polypeptide 
using the host cells of the invention. In one embodiment, the method comprises culturing the host cell of invention (into 
which a recombinant expression vector encoding a GL50 polypeptide has been introduced) in a suitable medium such 
that a GL50 polypeptide is produced. In another embodiment, the method further comprises isolating a GL50 polypeptide 
from the medium or the host cell. 

[0195] Certain host cells of the invention can also be used to produce non-human transgenic animals. For example, 
in one embodiment, a host cell of the invention is a fertilized oocyte or an embryonic stem cell into which GL50-coding 
sequences have been introduced. Such host cells can then be used to create non-human transgenic animals in which 
exogenous GL50 sequences have been introduced into their genome or homologous recombinant animals in which 
endogenous GL50 sequences have been altered. Such animals are useful for studying the function and/or activity of a 
GL50 polypeptide and for identifying and/or evaluating modulators of GL50 activity. As used herein, a "transgenic animal" 
is a non-human animal, preferably a mammal, more preferably a rodent such as a rat or mouse, in which one or more 
of the cells of the animal includes a transgene. Other examples of transgenic animals include non-human primates, 
sheep, dogs, cows, goats, chickens, amphibians, and the like. A transgene is exogenous DNA which is integrated into 
the genome of a cell from which a transgenic animal develops and which remains in the genome of the mature animal, 
thereby directing the expression of an encoded gene product in one or more cell types or tissues of the transgenic 
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animal. As used herein, a "homologous recombinant animal" is a non-human animal, preferably a mammal, more pref- 
erably a mouse, in which an endogenous GL50 gene has been altered by homologous recombination between the 
endogenous gene and an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic cell of the 
animal, prior to development of the animal. 

[01 96] A transgenic animal of the invention can be created by introducing a GL50-encoding nucleic acid into the male 
pronucleus of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the oocyte to develop in a 
pseudopregnant female foster animal. The mGL50-1 sequence of SEQ ID NO:1 ,3, or 5 can be introduced as a transgene 
into the genome of a non-human animal. Alternatively, a nonhuman homologue of a hGL50 gene, such as a mouse or 
rat GL50 gene, can be used as a transgene. Alternatively, a GL50 gene homologue, such as another GL50 family 
member, can be isolated based on hybridization to the GL50 family cDNA sequences of SEQ ID NO:1 , 3, or 5 (described 
further in subsection I above) and used as a transgene. Intronic sequences and polyadenylation signals can also be 
included in the transgene to increase the efficiency of expression of the transgene. A tissue-specific regulatory sequence 
(s) can be operably linked to a GL50 transgene to direct expression of a GL50 polypeptide to particular cells. Methods 
for generating transgenic animals via embryo manipulation and microinjection, particularly animals such as mice, have 
become conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by 
Leder et al., U.S. Patent No. 4,873,191. by Wagner et al. and in Hogan, B., Manipulating the Mouse Embryo, (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Similar methods are used for production of other 
transgenic animals. A transgenic founder animal can be identified based upon the presence of a GL50 transgene in its 
genome and/or expression of GL50 mRNA in tissues or cells of the animals. A transgenic founder animal can then be 
used to breed additional animals carrying the transgene. Moreover, transgenic animals carrying a transgene encoding 
a GL50 polypeptide can further be bred to other transgenic animals carrying other transgenes. 

[0197] To create a homologous recombinant animal, a vector is prepared which contains at least a portion of a GL50 
gene into which a deletion, addition or substitution has been introduced to thereby alter, e.g., functionally disrupt, the 
GL50 gene. The GL50 gene can be a human gene (e.g., the SEQ ID NO:1 , 3, or 5), but more preferably, is a non-human 
homologue of a hGL50 gene (e.g., a cDNA isolated by stringent hybridization with the nucleotide sequence of SEQ ID 
NO: 1 , 3, or 5). For example, a mouse GL50 gene can be used to construct a homologous recombination vector suitable 
for altering an endogenous GL50 gene in the mouse genome. In a preferred embodiment, the vector is designed such 
that, upon homologous recombination, the endogenous GL50 gene is functionally disrupted (i.e., no longer encodes a 
functional protein; also referred to as a "knock out" vector). Alternatively, the vector can be designed such that, upon 
homologous recombination, the endogenous GL50 gene is mutated or otherwise altered but still encodes a functional 
protein (e.g., the upstream regulatory region can be altered to thereby alter the expression of the endogenous GL50 
polypeptide). In the homologous recombination vector, the altered portion of the GL50 gene is flanked at its 5' and 3' 
ends by additional nucleic acid sequence of the GL50 gene to allow for homologous recombination to occur between 
the exogenous GL50 gene carried by the vector and an endogenous GL50 gene in an embryonic stem cell. The additional 
flanking GL50 nucleic acid sequence is of sufficient length for successful homologous recombination with the endogenous 
gene. Typically, several kilobases of flanking DNA (both at the 5' and 3' ends) are included in the vector (see e.g., 
Thomas, K.R. and Capecchi, M. R. (1987) Cell 51:503 for a description of homologous recombination vectors). The 
vector is introduced into an embryonic stem cell line (e.g., by electroporation) and cells in which the introduced GL50 
gene has homologously recombined with the endogenous GL50 gene are selected (see, e.g., Li, E. et al. (1992) Cell 
69:91 5). The selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to form aggregation chimeras 
(see e.g., Bradley, A. in Teratocarci nomas and Embryonic Stem Cells: A Practical Approach, E.J. Robertson, ed. (IRL, 
Oxford, 1987) pp. 113-152). A chimeric embryo can then be implanted into a suitable pseudopregnant female foster 
animal and the embryo brought to term. Progeny harboring the homologously recombined DNA in their germ cells can 
be used to breed animals in which all cells of the animal contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination vectors and homologous recom- 
binant animals are described further in Bradley, A. (1991) Current Opinion in Biotechnology 2:823-829 and in PCT 
International Publication Nos.: WO 90/11354 by Le Mouellec et al.; WO 91/01140 by Smithies et al.; WO 92/0968 by 
Zijlstra et al.; and WO 93/041 69 by Berns et al. 

[0198] In addition to the foregoing, the skilled artisan will appreciate that other approaches known in the art for ho- 
mologous recombination can be applied to the instant invention. Enzyme-assisted site-specific integration systems are 
known in the art and can be applied to integrate a DNA molecule at a predetermined location in a second target DNA 
molecule. Examples of such enzyme-assisted integration systems include the Cre recombinase-lox target system (e.g., 
as described in Baubonis, W. and Sauer, B. (1993) Nucl. Acids Res. 21:2025-2029; and Fukushige, S. and Sauer, B. 
(1992) Proc. Natl. Acad. Sci. USA 89:7905-7909) and the FLP recombinase-FRT target system (e.g., as described in 
Dang, D. T. and Perrimon, N. (1992) Dev. Genet. 13:367-375; and Fiering, S. et al. (1993) Proc. Natl. Acad. Sci. USA 
90:8469-8473). Tetracycline-regulated inducible homologous recombination systems, such as described in PCT Publi- 
cation No. WO 94/29442 and PCT Publication No. WO 96/01313, also can be used. 

[01 99] For example, in another embodiment, transgenic non-humans animals can be produced which contain selected 
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systems which allow for regulated expression of the transgene. One example of such a system is the cre/loxP recombinase 
system of bacteriophage P1 . For a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) Proc. 
Natl. Acad. Sci. USA 89:6232-6236. Another example of a recombinase system is the FLP recombinase system of 
Saccharomyces cerevisiae (O'Gorman et al. (1 991 ) Science 251 :1 351 -1 355. If a cre/loxP recombinase system is used 
5 to regulate expression of the transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such animals can be provided through the construction of "double" transgenic animals, 
e.g., by mating two transgenic animals, one containing a transgene encoding a selected protein and the other containing 
a transgene encoding a recombinase. 

[0200] Clones of the non-human transgenic animals described herein can also be produced according to the methods 
10 described in Wilmut, I. et al. (1997) Nature 385:81 0-813 and PCT International Publication Nos. WO 97/07668 and WO 
97/07669. In brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and induced to exit the growth 
cycle and enter Go phase. The quiescentcell can then be fused, e.g., through the use of electrical pulses, to an enucleated 
oocyte from an animal of the same species from which the quiescent cell is isolated. The reconstructed oocyte is then 
cultured such that it develops to morula or blastocyte and then transferred to pseudopregnant female foster animal. The 
15 offspring borne of this female foster animal will be a clone of the animal from which the cell, e.g., the somatic cell, is isolated. 

V. Pharmaceutical Compositions 

[0201] GL50 modulators ("active compounds") of the invention (e.g., GL50 inhibitory or stimulatory agents, including 
20 GL50 nucleic acid molecules, polypeptides, antibodies, or compounds identified as modulators of a GL50 activity) can 
be incorporated into pharmaceutical compositions suitable for administration. Such compositions typically comprise the 
nucleic acid molecule, polypeptide, or antibody and a pharmaceutically acceptable carrier. As used herein the language 
"pharmaceutically acceptable carrier" is intended to include any and all solvents, dispersion media, coatings, antibacterial 
and antifungal agents, isotonic and absorption delaying agents, and the like, compatible with pharmaceutical adminis- 
25 tration. The use of such media and agents for pharmaceutically active substances is well known in the art. Except insofar 
as any conventional media or agent is incompatible with the active compound, use thereof in the compositions is con- 
templated. Supplementary active compounds can also be incorporated into the compositions. 

[0202] A pharmaceutical composition of the invention is formulated to be compatible with its intended route of admin- 
istration. Examples of routes of administration include parenteral, e.g.; intravenous, intradermal, subcutaneous, oral 

30 (e.g., inhalation), transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions used for 
parenteral, intradermal, or subcutaneous application can include the following components: a sterile diluent such as 
water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; 
chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or phosphates and agents 

35 for the adjustment of tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, such as 
hydrochloric acid or sodium hydroxide. The parenteral preparation can be enclosed in ampoules, disposable syringes 
or multiple dose vials made of glass or plastic. 

[0203] Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions (where water soluble) 
or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. For 

^o intravenous administration, suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, 
Parsippany, NJ) or phosphate buffered saline (PBS). In all cases, the composition must be sterile and should be fluid 
to the extent that easy syringeability exists. It must be stable under the conditions of manufacture and storage and must 
be preserved against the contaminating action of microorganisms such as bacteria and fungi. The carrier can be a 
solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, 

45 and liquid polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can be maintained, for 
example, by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersion 
and by the use of surfactants. Prevention of the action of microorganisms can be achieved by various antibacterial and 
antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, 
it will be preferable to include isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride 

50 in the composition. Prolonged absorption of the injectable compositions can be brought about by including in the com- 
position an agent which delays absorption, for example, aluminum monostearate and gelatin. 

[0204] Sterile injectable solutions can be prepared by incorporating the active compound (e.g., a GL50 polypeptide, 
nucleic acid molecule, or anti-GL50 antibody) in the required amount in an appropriate solvent with one or a combination 
of ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by 
55 incorporating the active compound into a sterile vehicle which contains a basic dispersion medium and the required 
other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable 
solutions, the preferred methods of preparation are vacuum drying and freeze-drying which yields a powder of the active 
ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof. 
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[0205] Oral compositions generally include an inert diluent or an edible carrier. They can be enclosed in gelatin capsules 
or compressed into tablets. For the purpose of oral therapeutic administration, the active compound can be incorporated 
with excipients and used in the form of tablets, troches, or capsules. Oral compositions can also be prepared using a 
fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is applied orally and swished and 
expectorated or swallowed. Pharmaceutically compatible binding agents, and/or adjuvant materials can be included as 
part of the composition. The tablets, pills, capsules, troches and the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such 
as starch or lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant such as magnesium 
stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a 
flavoring agent such as peppermint, methyl salicylate, or orange flavoring. 

[0206] For administration by inhalation, the compounds are delivered in the form of an aerosol spray from pressured 
container or dispenser which contains a suitable propellant, e.g., a gas such as carbon dioxide, or a nebulizer. 
[0207] Systemic administration can also be by transmucosal or transdermal means. For transmucosal or transdermal 
administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are 
generally known in the art, and include, for example, for transmucosal administration, detergents, bile salts, and fusidic 
acid derivatives. Transmucosal administration can be accomplished through the use of nasal sprays or suppositories. 
For transdermal administration, the active compounds are formulated into ointments, salves, gels, or creams as generally 
known in the art. 

[0208] The compounds can also be prepared in the form of suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention enemas for rectal delivery. 

[0209] In one embodiment, the active compounds are prepared with carriers that will protect the compound against 
rapid elimination from the body, such as a controlled release formulation, including implants and microencapsulated 
delivery systems. Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 
polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of such formulations will be 
apparent to those skilled in the art. The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected cells with monoclonal antibodies 
to viral antigens) can also be used as pharmaceutically acceptable carriers. These can be prepared according to methods 
known to those skilled in the art, for example, as described in U.S. Patent No. 4,522,81 1 . 

[0210] It is especially advantageous to formulate oral or parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete units suited as 
unitary dosages for the subject to be treated; each unit containing a predetermined quantity of active compound calculated 
to produce the desired therapeutic effect in association with the required pharmaceutical carrier. The specification for 
the dosage unit forms of the invention are dictated by and directly dependent on the unique characteristics of the active 
compound and the particular therapeutic effect to be achieved, and the limitations inherent in the art of compounding 
such an active compound for the treatment of individuals. 

[021 1] Toxicity and therapeutic efficacy of such compounds can be determined by standard pharmaceutical procedures 
in cell cultures or experimental animals, e.g., for determining the LD50 (the dose lethal to 50% of the population) and 
the ED50 (the dose therapeutically effective in 50% of the population). The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as the ratio LD50/ED50. Compounds which exhibit large ther- 
apeutic indices are preferred. While compounds that exhibit toxic side effects may be used, care should be taken to 
design a delivery system that targets such compounds to the site of affected tissue in order to minimize potential damage 
to uninfected cells and, thereby, reduce side effects. 

[0212] The data obtained from the cell culture assays and animal studies can be used in formulating a dosage range 
for use in humans. The dosage of such compounds lies preferably within a range of circulating concentrations that 
include the ED50 with little or no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the method of the invention, the therapeu- 
tically effective dose can be estimated initially from cell culture assays. A dose may be formulated in animal models to 
achieve a circulating plasma concentration range that includes the IC50 (i.e., the concentration of the test compound 
which achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can be used to 
more accurately determine useful doses in humans. Levels in plasma maybe measured, for example, by high performance 
liquid chromatography. 

[0213] The nucleic acid molecules of the invention can be inserted into vectors and used as gene therapy vectors. 
Gene therapy vectors can be delivered to a subject by, for example, intravenous injection, local administration (see U.S. 
Patent 5,328,470) or by stereotactic injection (see e.g., Chen et al. (1994) Proc. Natl. Acad Sci. USA 91 :3054-3057). 
The pharmaceutical preparation of the gene therapy vector can include the gene therapy vector in an acceptable diluent, 
or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the complete 
gene delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, the pharmaceutical prepa- 
ration can include one or more cells which produce the gene delivery system. 
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[021 4] The pharmaceutical compositions can be included in a container, pack, or dispenser together with instructions 
for administration. 

VI. Uses and Methods of the Invention 

5 

[0215] The nucleic acid molecules, polypeptides, protein homologues, and antibodies described herein can be used 
in one or more of the following methods: a) methods of treatment, e.g., up- or down-modulating the immune response; 
b) screening assays; c) predictive medicine (e.g., diagnostic assays, prognostic assays, monitoring clinical trials, and 
pharmacogenetics). The isolated nucleic acid molecules of the invention can be used, for example, to express GL50 

10 polypeptide (e.g., via a recombinant expression vector in a host cell in gene therapy applications), to detect GL50 mRNA 
(e.g., in a biological sample) or a genetic alteration in a GL50 gene, and to modulate GL50 activity, as described further 
below. The GL50 polypeptides can be used to treat disorders characterized by insufficient or excessive production of 
GL50 inhibitors. In addition, the GL50 polypeptides can be used to screen for naturally occurring GL50 ligands, to screen 
for drugs or compounds which modulate GL50 activity, as well as to treat disorders characterized by insufficient or 

15 excessive production of GL50 polypeptide or production of GL50 polypeptide forms which have decreased or aberrant 
activity compared to GL50 wild type polypeptide. Moreover, the anti-GL50 antibodies of the invention can be used to 
detect and isolate GL50 polypeptides, regulate the bioavailability of GL50 polypeptides, and modulate GL50 activity e.g., 
modulate immune responses. 

20 a. Methods of Treatment: 

[0216] The present invention provides for both prophylactic and therapeutic methods of treating a subject at risk of 
(or susceptible to) a disorder or having a disorder associated with aberrant GL50 expression or activity or a disorder 
that would benefit from modulation of GL50 activity. 

25 

1 . Prophylactic Methods 

[0217] In one aspect, the invention provides a method for preventing in a subject, a disease or condition associated 
with an aberrant GL50 expression or activity, by administering to the subject a GL50 polypeptide or an agent which 

30 modulates GL50 polypeptide expression or at least one GL50 activity. Subjects at risk for a disease which is caused or 
contributed to by aberrant GL50 expression or activity can be identified by, for example, any or a combination of diagnostic 
or prognostic assays as described herein. Administration of a prophylactic agent can occur prior to the manifestation of 
symptoms characteristic of GL50 aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in 
its progression. Depending on the type of GL50 aberrancy or condition, for example, a GL50 polypeptide, GL50 agonist 

35 or GL50 antagonist agent can be used for treating the subject. The appropriate agent can be determined based on 
screening assays described herein. 

2. Therapeutic Methods 

40 [0218] Another aspect of the invention pertains to methods of modulating GL50 expression or activity for therapeutic 
purposes. Accordingly, in an exemplary embodiment, the modulatory method of the invention involves contacting a cell 
with a GL50 polypeptide or agent that modulates one or more of the activities of GL50 polypeptide associated with the 
cell. An agent that modulates GL50 polypeptide activity can be an agent as described herein, such as a nucleic acid or 
a polypeptide, a naturally-occurring target molecule of a GL50 polypeptide (e.g., a GL50 ligand), a GL50 antibody, a 

45 GL50 agonist or antagonist, a peptidomimetic of a GL50 agonist or antagonist, or other small molecule. In one embod- 
iment, the agent stimulates one or more GL50 activities, Examples of such stimulatory agents include agents that 
stimulate the interaction of GL50 with a stimulatory receptor or inhibit the interaction of GL50 with an inhibitory receptor, 
e.g., active GL50 polypeptide, certain soluble forms of GL50 molecules, and a nucleic acid molecule encoding GL50 
polypeptide that has been introduced into the cell. In another embodiment, the agent inhibits one or more GL50 activities. 

50 Examples of such inhibitory agents include agents that diminish the interaction of GL50 and a costimulatory receptor or 
promote the interaction between GL50 and an inhibitory receptor, e.g., antisense GL50 nucleic acid molecules, anti- 
GL50 antibodies, and GL50 inhibitors. These modulatory methods can be performed in vitro (e.g., by culturing the cell 
with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present invention 
provides methods of treating an individual afflicted with a disease or disorder that would benefit from modulation of a 

55 GL50 polypeptide, e.g., a disorder which would benefit from up- or down-modulation of the immune response, or which. is 
characterized by aberrant expression or activity of a GL50 polypeptide or nucleic acid molecule. In one embodiment, 
the method involves administering an agent (e.g., an agent identified by a screening assay described herein), or com- 
bination of agents that modulates (e.g., upregulates or downregulates) GL50 expression or activity. In another embod- 
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iment, the method involves administering a GL50 polypeptide or nucleic acid molecule as therapy to compensate for 
reduced or aberrant GL50 expression or activity. 

[0219] Stimulation of GL50 activity is desirable in situations in which GL50 is abnormally downregulated and/or in 
which increased GL50 activity is likely to have a beneficial effect. Likewise, inhibition of GL50 activity is desirable in 
5 situations in which GL50 is abnormally upregulated and/or in which decreased GL50 activity is likely to have a beneficial 
effect. 

3. Downregulation of Immune Responses 

10 [0220] It is possible to downregulate the function of a GL50 polypeptide, and thereby downregulate immune responses, 
in a number of ways. Downregulation may be in the form of inhibiting or blocking an immune response already in progress 
or may involve preventing the induction of an immune response. The functions of activated T cells may be inhibited, 
e.g., by suppressing T cell responses or by inducing specific tolerance in T cells, or by leading to the production of 
cytokines that dampen the immune response. Immunosuppression of T cell responses is generally an active, non- 

15 antigen-specific, process which leads to decreased T cell responsiveness and may require continuous exposure of the 
T cells to the suppressive agent. Tolerance, which involves inducing non-responsiveness or anergy in T cells, is distin- 
guishable from immunosuppression in that it is generally antigen-specific and persists after exposure to the tolerizing 
agent has ceased. Operationally, tolerance can be demonstrated by the lack of a T cell response upon reexposure to 
specific antigen where the reexposure occurs in the absence of the tolerizing agent. 

20 [0221 ] For example, GL50 polypeptides, (including nonactivating forms of a GL50 polypeptide) or anti-GL50 antibodies 
that result in the failure to deliver a costimulatory signal to T cells that have received a primary activation signal, can be 
used to block GL50 the interaction between GL50 and its ligand(s) on T cells and thereby provide a specific means by 
which to cause immunosuppression and/or induce tolerance in a subject. Such blocking or inhibitory forms of GL50 
polypeptides and fusion proteins and blocking antibodies can be identified by their ability to inhibit T cell proliferation 

25 and/or cytokine production when added to an in vitro costimulation assay as described herein and known in the art. In 
contrast to the inhibitory forms of a GL50 polypeptide, activating forms (such as an intact cell surface GL50 polypeptide 
and certain soluble forms of GL50) preferably transmit a costimulatory signal to the T cells, resulting in an increased 
secretion of cytokines (e.g., IL-1 0) when compared to activated T cells that have not received a costimulatory signal. 
[0222] I n one embodiment, fusion proteins comprising a GL50 first peptide fused to a second peptide having an activity 

30 of another B lymphocyte antigen (e.g., B7-1 or B7-2) can be used to modify T cell mediated immune responses. Alter- 
natively, two separate peptides having an activity of B lymphocyte antigens, (for example, a GL50 polypeptide plus a 
B7-2 and/or B7-1 polypeptide), or a combination of blocking antibodies (e.g., antibodies against a GL50 polypeptide 
with anti-B7-2 and/or anti-B7-1 monoclonal antibodies) can be combined as a single composition or administered sep- 
arately (simultaneously or sequentially), to upregulate or downregulate T cell mediated immune responses in a subject. 

35 Furthermore, a therapeutically active amount of one or more peptides having a GL50 polypeptide activity, with B7-1 
and/or B7-2 activity can be used in conjunction with other immunomodulating reagents to influence immune responses. 
Examples of other immunomodulating reagents include blocking antibodies, (e.g., against CD28, CTLA4, and/or ICOS, 
or against other T cell markers, or against cytokines), fusion proteins (e.g., CTLA4lg), or immunosuppressive drugs, 
(e.g., rapamycin, cyclosporine A or FK506). 

40 [0223] The peptides produced from the nucleic acid molecules of the present invention may also be useful in the 
construction of therapeutic agents which block T cell function by destruction of the T cell. For example, as described, 
soluble, secreted forms of a GL50 polypeptide or antibodies that bind to a ligand on a T cell can be used. Such secreted 
forms can be constructed by standard genetic engineering techniques. By linking a soluble form of a GL50 polypeptide 
or antibody to a toxin such as ricin, an agent capable of preventing T cell activation can be made. Infusion of one or a 

45 combination of immunotoxins, (e.g., GL50-ricin with B7-2-ricin and/or BT-1 -ricin), into a patient may result in the death 
of T cells, particularly of activated T cells that express higher amounts of CD28, CTLA4, and/or ICOS or GL50. 
[0224] Another method of preventing the function of a GL50 polypeptide is through the use of an antisense or triplex 
oligonucleotide. For example, an oligonucleotide complementary to the area around a GL50 polypeptide translation 
initiation site, can be synthesized. One or more antisense oligonucleotides can be added to cell media, typically at 200 

50 img/ml, or administered to a patient to prevent the synthesis of a GL50 polypeptide. The antisense oligonucleotide is 
taken up by cells and hybridizes to a GL50 mRNA to prevent translation. Alternatively, an oligonucleotide which binds 
double-stranded DNA to form a triplex construct to prevent DNA unwinding and transcription can be used. As a result 
of either, synthesis of a GL50 polypeptide is blocked. 

[0225] Downregulating or preventing one or more GL50 polypeptide functions, e.g., preventing high level lymphokine 
55 synthesis by activated T cells, will be useful in situations of tissue, skin and organ transplantation and in graft-versus- 
host disease (GVHD). For example, blockage of T cell function should result in reduced tissue destruction in tissue 
transplantation. Typically, in tissue transplants, rejection of the transplant is initiated through its recognition as foreign 
by T cells, followed by an immune reaction that destroys the transplant. The administration of a molecule which inhibits 
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or blocks interaction of a B7 lymphocyte antigen with its natural ligand(s) on immune cells (such as a soluble, monomeric 
form of a GL50 polypeptide alone or in conjunction with a monomeric form of a different B7 peptide (e.g., B7-1 , B7-2) 
or blocking antibody), prior to transplantation can lead to the binding of the molecule to the natural ligand(s) on the 
immune cells without transmitting the corresponding costimulatory signal. Blocking B lymphocyte antigen function in this 
manner prevents cytokine synthesis by immune cells, such as T cells and, thus, acts as an immunosuppressant. Moreover, 
the lack of costimulation may also be sufficient to anergize the T cells, thereby inducing tolerance in a subject. Induction 
of long-term tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration 
of these blocking reagents. To achieve sufficient immunosuppression or tolerance in a subject, it may also be necessary 
to block the function of a combination of B lymphocyte antigens. For example, it may be desirable to block the function 
of B7-1 and GL50, B7-2 and GL50, or B7-1 and B7-2 and a GL50 polypetide, by administering a soluble form of a 
combination of peptides having an activity of each of these antigens or blocking antibodies against these antigens 
(separatelyortogether in asingle composition) prior to transplantation. Alternatively, inhibitory formsof GL50 polypeptides 
can be used with other suppressive agents such as blocking antibodies against other T cell markers or against cytokines, 
other fusion proteins, e.g., CTLA4lg, or immunosuppressive drugs. 

[0226] The efficacy of particular blocking reagents in preventing organ transplant rejection or GVHD can be assessed 
using animal models that are predictive of efficacy in humans. Because B7 polypeptides display amino acid conservation 
across species, it is likely that other GL50 antigens can function across species, thereby allowing use of reagents 
composed of human proteins in animal systems. Examples of appropriate systems which can be used include allageneic 
cardiac grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine the 
immunosuppressive effects of CTLA4lg fusion proteins in vivo as described in Lenschow et al., Science, 257: 789-792 
(1 992) and Turka et al., Proc. Natl. Acad. Sci. USA, 89: 11 1 02-1 1 1 05 (1 992). In addition, murine models of GVHD (see 
Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to determine the effect 
of blocking function of a GL50 polypeptide in vivo on the development of that disease. 

[0227] Blocking a GL50 polypeptide function, e.g., by use of a peptide having a GL50 polypeptide activity alone or in 
combination with a peptide having B7-1 activity and/or a peptide having B7-2 activity, may also be therapeutically useful 
for treating autoimmune diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that 
are reactive againstself tissue and which promote the production of cytokines and autoantibodies involved in the pathology 
of the diseases. Preventing the activation of autoreactive T cells may reduce or eliminate disease symptoms. Adminis- 
tration of reagents which block costimulation of T cells by disrupting receptonligand interactions of B lymphocyte antigens 
can be used to inhibit T cell activation and prevent production of autoantibodies or T cell-derived cytokines which may 
be involved in the disease process. Additionally, blocking reagents may induce antigen-specif tolerance of autoreactive 
T cells which could lead to long-term relief from the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal models of human autoimmune 
diseases. Examples include murine experimental autoimmune encephalitis, systemic lupus erythematosus in MRUiprl 
/pr mice or NZB hybrid mice, murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB rats, and 
murine experimental myasthenia gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 
840-856). 

[0228] The IgE antibody response in atopic allergy is highly T cell dependent and, thus, inhibition of B lymphocyte 
antigen induced T cell activation may be useful therapeutically in the treatment of allergy and allergic reactions. An 
inhibitory form of a GL50 polypeptide, such as a peptide having a GL50 polypeptide activity alone or in combination with 
another B lymphocyte antigen, such as B7-1 or B7-2, can be administered to an allergic subject to inhibit T cell mediated 
allergic responses in the subject. Inhibition of GL50 costimulation of T cells may be accompanied by exposure to allergen 
in conjunction with appropriate MHC molecules. Allergic reactions may be systemic or local in nature, depending on the 
route of entry of the allergen and the pattern of deposition of IgE on mast cells or basophils. Thus, it may be necessary 
to inhibit T cell mediated allergic responses locally or systemically by proper administration of an inhibitory form of a 
GL50 polypeptide. 

[0229] Inhibition of T cell activation through blockage of aGL50 antigen function may also be important therapeutically 
in viral infections of T cells. For example, in the acquired immune deficiency syndrome (AIDS), viral replication is stimulated 
by T cell activation. Blocking a GL50 function could lead to a lower level of viral replication and thereby ameliorate the 
course of AIDS. In addition, it may also be desirable to block the function of a combination of B lymphocyte antigens 
i.e., GL50 with B7-2 and/or B7-1 . 

[0230] In one embodiment of the invention, a GL50 family member preferentially induces IL-10 secretion by a T cell 
(Hutloff et al. (1999) Nature 397:263). IL-10, while promoting the development of Th2 type responses, also leads to 
downmodulation of the production of certain cytokines, and a downmodulation of cell mediated immunity, e.g., by de- 
creasing macrophage activation (Bai et al. (1 997) Clin. Immunol. Immunopathol. 83:1 1 7; Koch et al. (1996) J. Exp. Med. 
1 84:741 ; de Vries (1 995) Ann. Med. 27:537). Accordingly, in one embodiment of the invention, increasing the activity of 
a GL50 family member can lead to downmodulation of a cell-mediated immune response. Thus, in one embodiment of 
the invention cell-mediated immune responses are decreased by increasing GL50 activity. 
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4. Upregulation of Immune Responses 

[0231] Upregulation of an immune response, e.g., by promoting a stimulatory activity of GL50 may also be useful in 
therapy. Upregulation of immune responses may be in the form of enhancing an existing immune response or eliciting 

5 an initial immune response. For example, enhancing an immune response through stimulating GL50 activity may be 
useful in cases of viral infection. Viral infections are cleared primarily by cytolytic T cells. In accordance with the present 
invention, it is believed that GL50 polypeptide interacting with its natural ligand(s) on T cells may result in an increase 
in the cytolytic activity of at least some T cells. The addition of an activating form of GL50, alone, or in combination with 
an activating form of a different B7 family polypeptide to stimulate T cell activity through the costimulation pathway would 

10 thus be therapeutically useful in situations where more rapid or thorough clearance of virus would be beneficial. These 
would include viral skin diseases such as Herpes or shingles, in which cases the monovalent or multi-valent soluble 
GL50 polypeptide or combination of such peptide with a peptide having B7-1 activity and/or a peptide having B7-2 activity 
is delivered topically to the skin. In addition, systemic viral diseases such as influenza, the common cold, and encephalitis 
might be alleviated by the administration of stimulatory forms of GL50 systemically. 

15 [0232] Alternatively, anti-viral immune responses may be enhanced in an infected patient by removing T cells from 
the patient, costimulating the T cells in vitro with viral antigen-pulsed APCs either expressing a GL50 peptide (alone or 
in combination with a peptide having B7-1 activity and/or a peptide having B7-2 activity) or together with a stimulatory 
form of a soluble GL50 peptide (alone or in combination with a peptide having B7-1 activity and/or a peptide having B7-2 
activity) and reintroducing the in vitro activated T cells into the patient. Another method of enhancing anti-viral immune 

20 responses would be to isolate infected cells from a patient, transfect them with a nucleic acid molecule encoding a 
peptide having the activity of a B lymphocyte antigen as described herein such that the cells express all or a portion of 
a GL50 antigen on their surface, and reintroduce the transfected cells into the patient. The infected cells would now be 
capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo. 

[0233] Stimulatory forms of GL50 molecules may also be used prophylactically in vaccines against various pathogens. 

25 Immunity against a pathogen, e.g., a virus, could be induced by vaccinating with a viral protein along with a stimulatory 
form of a GL50 polypeptide in an appropriate adjuvant. Alternately, an expression vector which encodes genes for both 
a pathogenic antigen and a peptide having the activity of a GL50 antigen, e.g., a vaccinia virus expression vector 
engineered to express a nucleic acid molecule encoding a viral protein and a nucleic acid molecule encoding a GL50 
polypeptide as described herein, can be used for vaccination. DNA vaccines can be administered by a variety of means, 

30 for example, by injection (e.g., intramuscular, intradermal, or the biolistic injection of DNA-coated gold particles into the 
epidermis with a gene gun that uses a particle accelerator or a compressed gas to inject the particles into the skin 
(Haynes et al. (1996) J. Biotechnol. 44:37)). Alternatively, DNA vaccines can be administered by non-invasive means. 
For example, pure or lipid-formulated DNA can be delivered to the respiratory system or targeted elsewhere, e.g., Peyers 
patches by oral delivery of DNA (Schubbert (1 997) Proc. Natl. Acad. Sci. USA 94:961 ). Attenuated microorganisms can 

35 be used for delivery to mucosal surfaces. (Sizemore et al. 1995. Science. 270:29). In one embodiment, antigen is 
administered concurrently with a stimulatory form of a GL50 molecule, 

[0234] In another application, upregulation or enhancement of GL50 function may be useful in the induction of tumor 
immunity. In one embodiment, the GL50 molecule is cell associated. Tumor cells (e.g., sarcoma, melanoma, lymphoma, 
leukemia, neuroblastoma, carcinoma) transfected with a nucleic acid encoding at least one GL50 antigen can be ad- 

^o ministered to a subject to overcome tumor-specific tolerance in the subject. If desired, the tumor cell can be transfected 
to express a combination of B7 polypeptides (e.g., B7-1 , B7-2, GL50). For example, tumor cells obtained from a patient 
can be transfected ex wVowith an expression vector directing the expression of a GL50 polypeptide alone, or in conjunction 
with a peptide having B7-1 activity and/or B7-2 activity. The transfected tumor cells are returned to the patient to result 
in expression of the peptides on the surface of the transfected cell. Alternatively, gene therapy techniques can be used 

45 to target a tumor cell for transfection in vivo. 

[0235] The presence of the peptide having the activity of a GL50 molecule on the surface of the tumor cell provides 
the necessary costimulation signal to T cells to induce a T cell mediated immune response against the transfected tumor 
cells. In addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to express sufficient amounts 
of MHC class I or MHC class II molecules, can be transfected with nucleic acid encoding all or a portion of (e.g., a 

50 cytoplasmic-domain truncated portion) of an MHC class I a chain protein and p 2 microglobulin protein or an MHC class 
II a chain protein and an MHC class II p chain protein to thereby express MHC class I or MHC class II proteins on the 
cell surface. Expression of the appropriate class I or class II MHC in conjunction with a peptide having the activity of a 
B lymphocyte antigen (e.g., B7-1 , B7-2, GL50) induces a T cell mediated immune response against the transfected 
tumor cell. Optionally, a gene encoding an antisense construct which blocks expression of an MHC class 1 1 associated 

55 protein, such as the invariant chain, can also be cotransfected with a DNA encoding a GL50 polypeptide to promote 
presentation of tumor associated antigens and induce tumor specific immunity. Expression of B7-1 by B7 negative murine 
tumor cells has been shown to induce T cell mediated specific immunity accompanied by tumor rejection and prolonged 
protection to tumor challenge in mice (Chen, L. et al. (1992) Cell 71:1093 -1102; Townsend, S. E. and Allison, J. P. 



35 



EP 1 900 816 A2 



(1993) Science 259:368-370; Baskar, S. et al. (1993) Proc. Natl. Acad Sci. USA 90:5687-5690). Thus, the induction of 
a T cell mediated immune response in a human subject may be sufficient to overcome tumor-specific tolerance in the 
subject. 

[0236] In another embodiment, an activating form of one or more GL50 peptides (e.g., expressed on a cell surface) 
5 can be administered to a tumor-bearing patient to provide a costimulatory signal to T cells in order to induce anti-tumor 
immunity using techniques that are known in the art. 

[0237] In a specific embodiment, T cells are obtained from a subject and cultured ex vivo to expand the population of 
T cells. In a further embodiment the T cells are then administered to a subject. T cells can be stimulated to proliferate 
in vitro by, for example, providing to the T cells a primary activation signal and a costimulatory signal, as is known in the 
10 art. Various forms of GL50 polypeptides can also be used to costimulate proliferation of T cells. In one embodiment T 
cells are cultured ex vivo according to the method described in PCT Application No. WO 94/29436. The costimulatory 
molecule can be soluble, attached to a cell membrane or attached to a solid surface, such as a bead. 

B. Identification of Cytokines Induced by GL50 Mediated Costimulation 

15 

[0238] The GL50 molecules as described herein can be used to identify cytokines which are produced by T cells in 
response to stimulation by a GL50 polypeptide. T cells can be suboptimally stimulated in vitro with a primary activation 
signal, such as phorbol ester, anti-CD3 antibody or preferably antigen in association with an MHC class II molecule, and 
given a costimulatory signal by a stimulatory form of GL50 antigen, for instance by a cell transfected with nucleic acid 

20 encoding a GL50 polypeptide and expressing the peptide on its surface or by a soluble, stimulatory form of the peptide. 
Known cytokines released into the media can be identified by ELISA or by the ability of an antibody which blocks the 
cytokine to inhibit T cell proliferation or proliferation of other cell types that is induced by the cytokine. An IL-4 ELISA kit 
is available from Genzyme (Cambridge MA), as is an IL-7 blocking antibody. Blocking antibodies against IL-9 and IL-12 
are available from Genetics Institute (Cambridge, MA). 

25 [0239] An in vitro T cell costimulation assay as described above can also be used in a method for identifying novel 
cytokines which may be induced by costimulation. For example, where stimulation of the CD28/CTLA4 pathway seems 
to enhance IL-2 secretion, stimulation of the ICOS pathway seems to enhance IL-1 0 secretion (Hutloff et al. 1 99. Nature 
397:263). If a particular activity induced upon costimulation, e.g., T cell proliferation, cannot be inhibited by addition of 
blocking antibodies to known cytokines, the activity may result from the action of an unknown cytokine. Following cos- 

30 timulation, this cytokine could be purified from the media by conventional methods and its activity measured by its ability 
to induce T cell proliferation. 

[0240] To identify cytokines which may prevent the induction of tolerance, an in vitro T cell costimulation assay as 
described above can be used. In this case, T cells would be given the primary activation signal and contacted with a 
selected cytokine, but would not be given the costimulatory signal. After washing and resting the T cells, the cells would 

35 be rechallenged with both a primary activation signal and a costimulatory signal. If the T cells do not respond (e.g., 
proliferate or produce cytokines) they have become tolerized and the cytokine has not prevented the induction of tolerance. 
However, if the T cells respond, induction of tolerance has been prevented by the cytokine. Those cytokines which are 
capable of preventing the induction of tolerance can be targeted for blockage in vivo in conjunction with reagents which 
block B lymphocyte antigens as a more efficient means to induce tolerance in transplant recipients or subjects with 

^o autoimmune diseases. For example, one could administer a GL50 blocking reagent together with a cytokine blocking 
antibody to a subject. 

C. Identification of Molecules which Influence Costimulation 

45 [0241] Another application of the peptide having the activity of a novel B lymphocyte antigen of the invention is the 
use of one or more of these peptides in screening assays to discover as yet undefined molecules which are modulators 
of costimulatory ligand binding and/or of intracellular signaling through T cells following costimulation. For example, a 
solid-phase binding assay using a peptide having the activity of a GL50 molecule, could be used to identify molecules 
to which GL50 binds and/or which inhibit binding of the antigen with an appropriate T cell ligand (e.g., CD28, CTLA4, 

50 or ICOS). In addition, an in vitro J cell costimulation assay as described above could be used to identify molecules which 
interfere with intracellular signaling through the T cells following costimulation as determined by the ability of these 
molecules to inhibit T cell proliferation and/or cytokine production (yet which do not prevent binding of a GL50 molecule 
to its ligand). For example, the compound cyclosporine A and rapamycin inhibit T cell activation through stimulation via 
the T cell receptor pathway but not via the CD28/CTLA4 pathway. Therefore, a different intracellular signaling pathway 

55 is involved in costimulation. Molecules which interfere with intracellular signaling via the CD28/CTLA4 and/or ICOS 
pathway may be effective as immunosuppressive agents in vivo with or without the use of an additional immunosup- 
pressant such as cyclosporine A or rapamycin. 
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D. Identification of Molecules which Modulate Expression of a GL50 



Polypeptide 

5 [0242] The antibodies produced using the proteins and peptides of the current invention can be used in a screening 
assay for molecules which modulate the expression of GL50 polypeptide on cells. For example, molecules which effect 
intracellular signaling which leads to induction of expression GL50 polypeptides e.g., in response to activation signals, 
can be identified by assaying expression of one or more GL50 polypeptides on the cell surface. Reduced immunofluo- 
rescent staining by an anti-GL50 antibody in the presence of the molecule would indicate that the molecule inhibits 

10 intracellular signals. Molecules which upregulate GL50 polypeptide expression result in an increased immunofluorescent 
staining. Alternatively, the effect of a molecule on expression of a GL50 polypeptide can be determined by detecting 
cellular GL50 mRNA levels using a probe of the invention. For example, a cell which expresses a GL50 polypeptide can 
be contacted with a molecule to be tested, and an increase or decrease in GL50 mRNA levels in the cell detected by 
standard technique, such as Northern hybridization analysis or conventional dot blot of mRNA or total poly(A + )RNAs 

15 using a mGL50-1 probe labeled with a detectable marker. Molecules which modulate expression of a GL50 polypeptide 
may be useful therapeutically for either upregulating or downregulating immune responses alone or in conjunction with 
soluble blocking orstimulating reagents. For instance, a molecule which inhibits expression of GL50 could be administered 
together with a GL50 blocking reagent for immunosuppressive purposes. Molecules which can be tested in the above- 
described assays include cytokines such as IL-4, ylNF, IL-10, IL-12, GM-CSF and prostagladins. 

20 

E. Screening Assays: 

[0243] The invention provides a method (also referred to herein as a "screening assay") for identifying modulators, 
i.e., candidate or test compounds or agents (e.g., peptides, peptidomimetics, small molecules or other drugs) which 
25 bind to GL50 polypeptides or portions thereof, have a stimulatory or inhibitory effect on, for example, GL50 expression 
or GL50 activity. 

[0244] In one embodiment, the invention provides assays for screening candidate or test compounds which bind to 
or modulate the activity of a GL50 polypeptide or polypeptide or biologically active portion thereof, e.g., modulate the 
ability of GL50 polypeptide to interact with a binding partner (e.g., a cognate ligand or intracellular interactor). For 
30 example, in one embodiment, portions of the extracellular domain of GL50can be used. In another embodiment, portions 
of the cytoplasmic domain of a GL50 molecule can be used. In another embodiment, portions of the transmembrane 
domain of a GL50 molecule can be used. 

[0245] In one embodiment, variant forms of a polypeptide comprising a GL50 domain can be used in a screening 
assay. For example, GL50 domains comprising an amino acid alteration (e.g., that have been mutagenized using, for 

35 example random or cassette mutagenesis) can be used in the subject screening assays. Alternatively, splicing variants 
of GL50 intracellular domains (e.g., GL50-1 intracellular domain, GL50-2 cytoplasmic domain or additional exons iden- 
tified upon sequencing of chromosome 21 or identified by RACE PCR) can be to screen for compounds. Such GL50 
variants can be used to identify compounds with activity against a range of GL50 molecules and can identify amino acid 
residues essential for GL50 activity. 

40 [0246] The test compounds of the present invention can be obtained using any of the numerous approaches in com- 
binatorial library methods known in the art, including: biological libraries; spatially addressable parallel solid phase or 
solution phase libraries; synthetic library methods requiring deconvolution; the 'one-bead one-compound' library method; 
and synthetic library methods using affinity chromatography selection. The biological library approach is limited to peptide 
libraries, while the other four approaches are applicable to peptide, non-peptide oligomer or small molecule libraries of 

45 compounds (Lam, K. S. (1997) Anticancer Drug Des. 12:145). 

[0247] Examples of methods for the synthesis of molecular libraries can be found in the art, for example in: DeWitt et 
al. (1993) Proc. Natl. Acad. Sci. USA 90:6909; Erb et al. (1994) Proc. Natl. Acad. Sci. USA 91 :11422; Zuckermann et 
al. (1 994) J. Med. Chem. 37:2678; Cho et al. (1 993) Science 261 :1 303; Carrell et al. (1 994) Angew. Chem. Int. Ed. Engl. 
33:2059; Carell et al. (1 994) Angew. Chem. Int. Ed. Engl. 33:2061 ; and in Gallop et al. (1 994) J. Med Chem. 37: 1 233. 

50 [0248] Libraries of compounds may be presented in solution (e.g., Houghten (1992) Biotechniques 13:412-421), or 
on beads (Lam (1 991 ) Nature 354:82-84), chips (Fodor (1 993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), 
spores (Ladner USP '409), plasmids (Cull et al. (1992) Proc. Natl. Acad. Sci. USA 89:1865-1869) or on phage (Scott 
and Smith (1990) Science 249:386-390); (Devlin (1990) Science 249:404-406); (Cwirla et al. (1990) Proc. Natl. Acad. 
Sci. 87:6378-6382); (Felici (1991) J. Mol. Biol. 222:301-310); (Ladner supra.). 

55 [0249] In another embodiment, an assay is a cell-based assay comprising contacting a cell expressing a GL50 target 
molecule (e.g., a GL50 ligand such as ICOS or intracellular interactor molecule) with a test compound and determining 
the ability of the test compound to modulate (e.g. stimulate or inhibit) the activity of the GL50 target molecule. In one 
embodiment, a GL50 target molecule is identified, e.g., in a two or three hybrid assay. In another embodiment, a GL50 
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interactor molecule is identified using standard methods for crosslinking GL50 to neighboring molecules followed by 
immunoprecipitation using anti-GL50 antibodies. 

[0250] In one embodiment, portions of the transmembrane and/or intracellular regions as defined by hydropathy plots 
or domains as defined by exon structure can be as bait in 2-hybrid assays to determine binding partners to these domains. 
Interacting proteins can be used in assays to quantitate the degree of GL50 binding to interaction partners potentially 
for production or quality control assays. In another embodiment,cytoplasmic domain splice variants can be used in 
different 2-hybrid assays to collect the entire range of protein interactors that bind to any GL50 splice variant. 
[0251] Determining the ability of the test compound to modulate the activity of a GL50 target molecule can be accom- 
plished, for example, by determining the ability of the GL50 polypeptide to bind to or interact with the GL50 target molecule 
or its ligand. Determining the ability of the GL50 polypeptide to bind to or interact with a ligand of a GL50 molecule can 
be accomplished, e.g., by direct binding. 

[0252] In a direct binding assay, the GL50 polypeptide could be coupled with a radioisotope or enzymatic label such 
that binding of the GL50 polypeptide to a GL50 target molecule can be determined by detecting the labeled GL50 
polypeptide in a complex. For example, GL50 molecules, e.g., GL50 polypeptides, can be labeled with 125 l, 35 S, 14 C, 
or 3 H, either directly or indirectly, and the radioisotope detected by direct counting of radioemmission or by scintillation 
counting. Alternatively, GL50 molecules can be enzymatically labeled with, for example, horseradish peroxidase, alkaline 
phosphatase, or luciferase, and the enzymatic label detected by determination of conversion of an appropriate substrate 
to product. 

[0253] It is also within the scope of this invention to determine the ability of a compound to modulate the interaction 
between GL50 and its target molecule, without the labeling of any of the interactants. For example, a microphysiometer 
can be used to detect the interaction of GL50 with its target molecule without the labeling of either GL50 or the target 
molecule. McConnell, H. M. et al. (1 992) Science 257:1 906-1 912. As used herein, a "microphysiometer" (e.g., Cytosen- 
sor) is an analytical instrument that measures the rate at which a cell acidifies its environment using a light-addressable 
potentiometric sensor (LAPS). Changes in this acidification rate can be used as an indicator of the interaction between 
compound and receptor. 

[0254] In a preferred embodiment, determining the ability of the GL50 polypeptide to bind to or interact with a GL50 
binding partner can be accomplished by determining the activity of the binding partner. For example, the activity of the 
target molecule can be determined by detecting induction of a cellular second messenger of the target (e.g., to phos- 
phorylate GL50 or another substrate on tyrosine residues), detecting catalytic/enzymatic activity of the target an appro- 
priate substrate, detecting the induction of a reporter gene (comprising a target-responsive regulatory element operatively 
linked to a nucleic acid encoding a detectable marker, e.g., chloramphenicol acetyl transferase), or detecting a target- 
regulated cellular response. For example, determining the ability of the GL50 polypeptide to bind to or interact with a 
GL50 target molecule can be accomplished, for example, by measuring the ability of a compound to downmodulate T 
cell costimulation in a proliferation assay, or by interfering with the ability of a GL50 polypeptide to bind to antibodies 
that recognize a portion of the GL50 polypeptide. 

[0255] In yet another embodiment, an assay of the present invention is a cell-free assay in which a GL50 polypeptide 
or biologically active portion thereof is contacted with a test compound and the ability of the test compound to bind to 
the GL50 polypeptide or biologically active portion thereof is determined. Binding of the test compound to the GL50 
polypeptide can be determined either directly or indirectly as described above. In a preferred embodiment, the assay 
includes contacting the GL50 polypeptide or biologically active portion thereof with a known compound which binds 
GL50 to form an assay mixture, contacting the assay mixture with a test compound, and determining the ability of the 
test compound to interact with a GL50 polypeptide, wherein determining the ability of the test compound to interact with 
a GL50 polypeptide comprises determining the ability of the test compound to preferentially bind to GL50 polypeptide 
or biologically active portion thereof as compared to the known compound. 

[0256] In another embodiment, the assay is a cell-free assay in which a GL50 polypeptide or biologically active portion 
thereof is contacted with a test compound and the ability of the test compound to modulate (e.g., stimulate or inhibit) 
the activity of the GL50 polypeptide or biologically active portion thereof is determined. Determining the ability of the test 
compound to modulate the activity of a GL50 polypeptide can be accomplished, for example, by determining the ability 
of the GL50 polypeptide to bind to a GL50 target molecule or ligand by one of the methods described above for determining 
direct binding. Determining the ability of the GL50 polypeptide to bind to a GL50 target molecule can also be accomplished 
using a technology such as real-time Biomolecular Interaction Analysis (BIA). Sjolander, S. and Urbaniczky, C. (1991) 
Anal. Chem. 63:2338-2345 and Szabo et al. (1995) Curr. Opin. Struct. Biol. 5:699-705. As used herein, "BIA" is a 
technology for studying biospecific interactions in real time, without labeling any of the interactants (e.g., BIAcore). 
Changes in the optical phenomenon of surface plasmon resonance (SPR) can be used as an indication of real-time 
reactions between biological molecules. 

[0257] In an alternative embodiment, determining the ability of the test compound to modulate the activity of a GL50 
polypeptide can be accomplished by determining the ability of the GL50 polypeptide to further modulate the activity of 
a GL50 target molecule (e.g., a GL50 mediated signal transduction pathway component). For example, the activity of 
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the effector molecule on an appropriate target can be determined, or the binding of the effector to an appropriate target 
can be determined as previously described. 

[0258] In yet another embodiment, the cell-free assay involves contacting a GL50 polypeptide or biologically active 
portion thereof with a known compound which binds the GL50 polypeptide to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to interact with the GL50 polypeptide, 
wherein determining the ability of the test compound to interact with the GL50 polypeptide comprises determining the 
ability of the GL50 polypeptide to preferentially bind to or modulate the activity of a GL50 target molecule. 
[0259] The cell-free assays of the present invention are amenable to use of both soluble and/or membrane-bound 
forms of proteins (e.g., GL50 polypeptides or biologically active portions thereof, or receptors to which GL50 binds). In 
the case of cell-free assays in which a membrane-bound form a protein is used (e.g., a cell surface GL50 receptor) it 
may be desirable to utilize a solubilizing agent such that the membrane-bound form of the protein is maintained in 
solution. Examples of such solubilizing agents include non-ionicdetergentssuch as n-octylglucoside, n-dodecylglucoside, 
n-dodecylmattoside, octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton® X-100, Triton® X-114, Thes- 
it®, lsotridecypoly(ethylene glycol ether) n , 3-[(3-cholamidopropyl)dimethylamminio]-1 -propane sulfonate (CHAPS), 
3-[(3-cholamidopropyl)dimethylamminio]-2-hydroxy-1 -propane sulfonate (CHAPSO), or N-dodecyl=N,N-dimethyl-3-am- 
monio-1 -propane sulfonate. 

[0260] In one embodiment of the above assay methods of the present invention, it may be desirable to immobilize 
either GL50 or its target molecule to facilitate separation of complexed from uncomplexed forms of one or both of the 
proteins, as well as to accommodate automation of the assay. Binding of a test compound to a GL50 polypeptide, or 
interaction of a GL50 polypeptide with a target molecule in the presence and absence of a candidate compound, can 
be accomplished in any vessel suitable for containing the reactants. Examples of such vessels include microtitre plates, 
test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein can be provided which adds a domain that 
allows one or both of the proteins to be bound to a matrix. For example, glutathione-S-transferase/ GL50 fusion proteins 
or glutathione-S-transferase/target fusion proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, 
St. Louis, MO) or glutathione derivatized microtiter plates, which are then combined with the test compound or the test 
compound and either the non-adsorbed target protein or GL50 polypeptide, and the mixture incubated under conditions 
conducive to complex formation (e.g., at physiological conditions for salt and pH). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound components, the matrix immobilized in the case of beads, 
complex determined either directly or indirectly, for example, as described above. Alternatively, the complexes can be 
dissociated from the matrix, and the level of GL50 binding or activity determined using standard techniques. 
[0261 ] Other techniques for immobilizing proteins on matrices can also be used in the screening assays of the invention. 
For example, either a GL50 polypeptide or a GL50 target molecule can be immobilized utilizing conjugation of biotin and 
streptavidin. Biotinylated GL50 polypeptide or target molecules can be prepared from biotin-NHS (N-hydroxy-succinim- 
ide) using techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and immobilized in 
the wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies reactive with GL50 polypeptide 
or target molecules but which do not interfere with binding of the GL50 polypeptide to its target molecule can be derivatized 
to the wells of the plate, and unbound target or GL50 polypeptide trapped in the wells by antibody conjugation. Methods 
for detecting such complexes, in addition to those described above for the GST-immobilized complexes, include immu- 
nodetection of complexes using antibodies reactive with the GL50 polypeptide or target molecule, as well as enzyme- 
linked assays which rely on detecting an enzymatic activity associated with the GL50 polypeptide or target molecule. 
[0262] In another embodiment, modulators of GL50 expression are identified in a method wherein a cell is contacted 
with a candidate compound and the expression of GL50 mRNA or protein in the cell is determined. The level of expression 
of GL50 mRNA or protein in the presence of the candidate compound is compared to the level of expression of GL50 
mRNA or protein in the absence of the candidate compound. The candidate compound can then be identified as a 
modulator of GL50 expression based on this comparison. For example, when expression of GL50 mRNA or protein is 
greater (e.g., statistically significantly greater) in the presence of the candidate compound than in its absence, the 
candidate compound is identified as a stimulator of GL50 mRNA or protein expression. Alternatively, when expression 
of GL50 mRNA or protein is less (e.g., statistically significantly less) in the presence of the candidate compound than 
in its absence, the candidate compound is identified as an inhibitor of GL50 mRNA or protein expression. The level of 
GL50 mRNA or protein expression in the cells can be determined by methods described herein for detecting GL50 
mRNA or protein. 

[0263] In yet another aspect of the invention, the GL50 polypeptides, e.g., soluble or membrane bound molecules or 
portions thereof (e.g., transmembrane or cytoplasmic portions), can be used as "bait proteins" in a two-hybrid assay or 
three-hybrid assay (see, e.g., U.S. Patent No. 5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. 
Biol. Chem. 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi et al. (1993) Oncogene 8: 
1693-1696; and Brent WO94/10300), to identify other proteins, which bind to or interact with GL50 ("GL50-binding 
proteins" or "GL50-bp") and are involved in GL50 activity. Such GL50-binding proteins are also likely to be involved in 
the propagation of signals by the GL50 polypeptides or GL50 targets as, for example, downstream elements of a GL50- 
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mediated signaling pathway. Alternatively, such GL50-binding proteins may be GL50 inhibitors. 

[0264] The two-hybrid system is based on the modular nature of most transcription factors, which consist of separable 
DNA-binding and activation domains. Briefly, the assay utilizes two different DNA constructs. In one construct, the gene 
that codes for a GL50 polypeptide is fused to a gene encoding the DNA binding domain of a known transcription factor 

5 (e.g., GAL-4). In the other construct, a DNA sequence, from a library of DNA sequences, that encodes an unidentified 
protein ("prey" or "sample") is fused to a gene that codes for the activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, In vivo, forming a GL50-dependent complex, the DNA-binding and 
activation domains of the transcription factor are brought into close proximity. This proximity allows transcription of a 
reporter gene (e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to the transcription 

10 factor. Expression of the reporter gene can be detected and cell colonies containing the functional transcription factor 
can be isolated and used to obtain the cloned gene which encodes the protein which interacts with the GL50 polypeptide. 
[0265] This invention further pertains to novel agents identified by the above-described screening assays. Accordingly, 
it is within the scope of this invention to further use an agent identified as described herein in an appropriate animal 
model. For example, an agent identified as described herein (e.g., a GL50 modulating agent, an antisense GL50 nucleic 

15 acid molecule, a GL50-specific antibody, or a GL50 -binding partner) can be used in an animal model to determine the 
efficacy, toxicity, or side effects of treatment with such an agent. Alternatively, an agent identified as described herein 
can be used in an animal model to determine the mechanism of action of such an agent. Furthermore, this invention 
pertains to uses of novel agents identified by the above-described screening assays for treatments as described herein. 

20 f. Detection Assays 

[0266] Portions or fragments of the cDNA sequences identified herein (and the corresponding complete gene se- 
quences) can be used in numerous ways as polynucleotide reagents. For example, these sequences can be used to: 
(i) map their respective genes on a chromosome; and, thus, locate gene regions associated with genetic disease; (ii) 
25 identify an individual from a minute biological sample (tissue typing); and (iii) aid in forensic identification of a biological 
sample. These applications are described in the subsections below. 

1. Chromosome Mapping 

30 [0267] GL50 has been mapped to human chromosome 21 q22. Accordingly, portions or fragments of GL50 nucleotide 
sequences (both coding and non-coding), described herein, can be used to correlate these sequences with genes 
associated with disease. 

[0268] The physical position of a sequence on the chromosome can be correlated with genetic map data. (Such data 
are found, for example, in V. McKusick, Mendelian Inheritance in Man, available on-line through Johns Hopkins University 
35 Welch Medical Library). The relationship between a gene and a disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), described in, for example, 
Egeland, J. et al. (1987) Nature, 325:783-787. 

[0269] Moreover, differences in the DNA sequences between individuals affected and unaffected with a disease 
associated with the GL50 gene, can be determined. If a mutation is observed in some or all of the affected individuals 
^o but not in any unaffected individuals, then the mutation is likely to be the causative agent of the particular disease. 
Comparison of affected and unaffected individuals generally involves first looking for structural alterations in the chro- 
mosomes, such as deletions or translocations that are visible from chromosome spreads or detectable using PCR based 
on that DNA sequence. Ultimately, complete sequencing of genes from several individuals can be performed to confirm 
the presence of a mutation and to distinguish mutations from polymorphisms. 

45 

2. Tissue Typing 

[0270] The GL50 sequences of the present invention can also be used to identify individuals from minute biological 
samples. The United States military, for example, is considering the use of restriction fragment length polymorphism 
50 (RFLP) for identification of its personnel. In this technique, an individual's genomic DNA is digested with one or more 
restriction enzymes, and probed on a Southern blot to yield unique bands for identification. This method does not suffer 
from the current limitations of "Dog Tags" which can be lost, switched, or stolen, making positive identification difficult. 
The sequences of the present invention are useful as additional DNA markers for RFLP (described in U.S. Patent 
5,272,057). 

55 [0271] Furthermore, the sequences of the present invention can be used to provide an alternative technique which 
determines the actual base-by-base DNA sequence of selected portions of an individual's genome. Thus, the GL50 
nucleotide sequences described herein can be used to prepare two PCR primers from the 5' and 3' ends of the sequences. 
These primers can then be used to amplify an individual's DNA and subsequently sequence it. 
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[0272] Panels of corresponding DNA sequences from individuals, prepared in this manner, can provide unique indi- 
vidual identifications, as each individual will have a unique set of such DNA sequences due to allelic differences. The 
sequences of the present invention can be used to obtain such identification sequences from individuals and from tissue. 
The GL50 nucleotide sequences of the invention uniquely represent portions of the human genome. Allelic variation 

5 occurs to some degree in the coding regions of these sequences, and to a greater degree in the noncoding regions. It 
is estimated that allelic variation between individual humans occurs with a frequency of about once per each 500 bases. 
Each of the sequences described herein can, to some degree, be used as a standard against which DNA from an 
individual can be compared for identification purposes. Because greater numbers of polymorphisms occur in the non- 
coding regions, fewer sequences are necessary to differentiate individuals. The noncoding sequences af SEQ ID NO: 

10 1 5 3, or 5, can comfortably provide positive individual identification with a panel of primers which each yield a noncoding 
amplified sequence of 100 bases. If predicted coding sequences are used, a more appropriate number of primers for 
positive individual identification would be 500-2,000. 

[0273] If a panel of reagents from GL50 nucleotide sequences described herein is used to generate a unique identi- 
fication database for an individual, those same reagents can later be used to identify tissue from that individual. Using 
15 the unique identification database, positive identification of the individual, living or dead, can be made from extremely 
small tissue samples. 

3. Use of Partial GL50 Sequences in Forensic Biology 

20 [0274] DNA-based identification techniques can also be used in forensic biology. Forensic biology is a scientific field 
employing genetic typing of biological evidence found at a crime scene as a means for positively identifying, for example, 
a perpetrator of a crime. To make such an identification, PCR technology can be used to amplify DNA sequences taken 
from very small biological samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, or semen found 
at a crime scene. The amplified sequence can then be compared to a standard, thereby allowing identification of the 
25 origin of the biological sample. 

[0275] The sequences of the present invention can be used to provide polynucleotide reagents, e.g., PCR primers, 
targeted to specific loci in the human genome, which can enhance the reliability of DNA-based forensic identifications 
by, for example, providing another "identification marker" (i.e. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequence information can be used for identification as an accurate alter- 
so native to patterns formed by restriction enzyme generated fragments. Sequences targeted to noncoding regions are 
particularly appropriate for this use as greater numbers of polymorphisms occur in the noncoding regions, making it 
easier to differentiate individuals using this technique. Examples of polynucleotide reagents include the GL50 nucleotide 
sequences or portions thereof having a length of at least 20 bases, preferably at least 30 bases. 
[0276] The GL50 nucleotide sequences described herein can further be used to provide polynucleotide reagents, e.g., 
35 labeled or labelable probes which can be used in, for example, an in situ hybridization technique, to identify a specific 
tissue, e.g., brain tissue. This can be very useful in cases where a forensic pathologist is presented with a tissue of 
unknown origin. Panels of such GL50 probes can be used to identify tissue by species and/or by organ type. 
[0277] In a similar fashion, these reagents, e.g., GL50 primers or probes can be used to screen tissue culture for 
contamination (i.e, screen for the presence of a mixture of different types of cells in a culture). 

40 

G. Predictive Medicine: 

[0278] The present invention also pertains to the field of predictive medicine in which diagnostic assays, prognostic 
assays, and monitoring clinical trials are used for prognostic (predictive) purposes to thereby treat an individual prophy- 

45 lactically. Accordingly, one aspect of the present invention relates to diagnostic assays for determining GL50 polypeptide 
and/or nucleic acid expression as well as GL50 activity, in the context of a biological sample (e.g., blood, serum, cells, 
tissue) to thereby determine whether an individual is afflicted with a disease or disorder, or is at risk of developing a 
disorder, associated with aberrant GL50 expression or activity. The invention also provides for prognostic (or predictive) 
assays for determining whether an individual is at risk of developing a disorder associated with GL50 polypeptide, nucleic 

50 acid expression or activity. For example, mutations in a GL50 gene can be assayed in a biological sample. Such assays 
can be used for prognostic or predictive purpose to thereby prophylactically treat an individual prior to the onset of a 
disorder characterized by or associated with GL50 polypeptide, nucleic acid expression or activity. 
[0279] Another aspect of the invention pertains to monitoring the influence of agents (e.g., drugs, compounds) on the 
expression or activity of GL50 in clinical trials. 

55 [0280] These and other agents are described in further detail in the following sections. 
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1 . Diagnostic Assays 

[0281 ] An exemplary method for detecting the presence or absence of GL50 polypeptide or nucleic acid in a biological 
sample involves obtaining a biological sample from a test subject and contacting the biological sample with a compound 

5 or an agent capable of detecting GL50 polypeptide or nucleic acid (e.g., mRNA, genomic DNA) that encodes GL50 
polypeptide such that the presence of GL50 polypeptide or nucleic acid is detected in the biological sample. A preferred 
agent for detecting GL50 mRNA or genomic DNA is a labeled nucleic acid probe capable of hybridizing to GL50 mRNA 
or genomic DNA. The nucleic acid probe can be, for example, a hGL50 nucleic acid, such as the nucleic acid of SEQ 
ID NO: 1, 3, or 5, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in 

10 length and sufficient to specifically hybridize under stringent conditions to GL50 mRNA or genomic DNA. Other suitable 
probes for use in the diagnostic assays of the invention are described herein. 

[0282] A preferred agent for detecting GL50 polypeptide is an antibody capable of binding to GL50 polypeptide, 
preferably an antibody with a detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An intact 
antibody, or a fragment thereof (e.g., Fab or F(ab') 2 ) can be used. The term "labeled", with regard to the probe or antibody, 

15 is intended to encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a detectable 
substance to the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity with another reagent 
that is directly labeled. Examples of indirect labeling include detection of a primary antibody using a fluorescently labeled 
secondary antibody and end-labeling of a DNA probe with biotin such that it can be detected with fluorescently labeled 
streptavidin. The term "biological sample" is intended to include tissues, cells and biological fluids isolated from a subject, 

20 as well as tissues, cells and fluids present within a subject. That is, the detection method of the invention can be used 
to detect GL50 mRNA, protein, or genomic DNA in a biological sample in vitro as well as in vivo. For example, in vitro 
techniques for detection of GL50 mRNA include Northern hybridizations and in situ hybridizations, in vitro techniques 
for detection of GL50 polypeptide include enzyme linked immunosorbent assays (ELISAs), Western blots, immunopre- 
cipitations and immunofluorescence. In vitro techniques for detection of GL50 genomic DNA include Southern hybridi- 

25 zations. Furthermore, in vivo techniques for detection of GL50 polypeptide include introducing into a subject a labeled 
anti-GL50 antibody. For example, the antibody can be labeled with a radioactive marker whose presence and location 
in a subject can be detected by standard imaging techniques. 

[0283] In one embodiment, the biological sample contains protein molecules from the test subject. Alternatively, the 
biological sample can contain mRNA molecules from the test subject or genomic DNA molecules from the test subject. 

30 a preferred biological sample is a serum sample isolated by conventional means from a subject. 

[0284] In another embodiment, the methods further involve obtaining a control biological sample from a control subject, 
contacting the control sample with a compound or agent capable of detecting GL50 polypeptide, mRNA, or genomic 
DNA, such that the presence of GL50 polypeptide, mRNA or genomic DNA is detected in the biological sample, and 
comparing the presence of GL50 polypeptide, mRNA or genomic DNA in the control sample with the presence of GL50 

35 polypeptide, mRNA or genomic DNA in the test sample. 

[0285] The invention also encompasses kits for detecting the presence of GL50 in a biological sample. For example, 
the kit can comprise a labeled compound or agent capable of detecting GL50 polypeptide or mRNA in a biological 
sample; means for determining the amount of GL50 in the sample; and means for comparing the amount of GL50 in the 
sample with a standard. The compound or agent can be packaged in a suitable container. The kit can further comprise 

40 instructions for using the kit to detect GL50 polypeptide or nucleic acid. 

2. Prognostic Assays 

[0286] The diagnostic methods described herein can furthermore be utilized to identify subjects having or at risk of 
45 developing a disease or disorder associated with aberrant GL50 expression or activity. For example, the assays described 
herein, such as the preceding diagnostic assays or the following assays, can be utilized to identify a subject having or 
at risk of developing a disorder associated with GL50 polypeptide, nucleic acid expression or activity. Thus, the present 
invention provides a method for identifying a disease or disorder associated with aberrant GL50 expression or activity 
in which a test sample is obtained from a subject and GL50 polypeptide or nucleic acid (e.g., mRNA, genomic DNA) is 
50 detected, wherein the presence of GL50 polypeptide or nucleic acid is diagnostic for a subject having or at risk of 
developing a disease or disorder associated with aberrant GL50 expression or activity. As used herein, a "test sample" 
refers to a biological sample obtained from a subject of interest. For example, a test sample can be a biological fluid 
(e.g., serum), cell sample, or tissue. 

[0287] Furthermore, the prognostic assays described herein can be used to determine whether an agent (e.g., an 
55 agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) can be 
administered to a subject to treat a disease or disorder associated with aberrant GL50 expression or activity. Thus, the 
present invention provides methods for determining whether a subject can be effectively treated with an agent for a 
disorder associated with aberrant GL50 expression or activity in which a test sample is obtained and GL50 polypeptide 



42 



EP 1 900 816 A2 



or nucleic acid expression or activity is detected (e.g., wherein the abundance of GL50 polypeptide or nucleic acid 
expression or activity is diagnostic for a subject that can be administered the agent to treat a disorder associated with 
aberrant GL50 expression or activity). 

[0288] The methods of the invention can also be used to detect genetic alterations in a GL50 gene, thereby determining 
if a subject with the altered gene is at risk for a disorder associated with the GL50 gene. In preferred embodiments, the 
methods include detecting, in a sample of cells from the subject, the presence or absence of a genetic alteration char- 
acterized by at least one of an alteration affecting the integrity of a gene encoding a GL50 -protein, or the mis-expression 
of the GL50 gene. For example, such genetic alterations can be detected by ascertaining the existence of at least one 
of 1 ) a deletion of one or more nucleotides from a GL50 gene; 2) an addition of one or more nucleotides to a GL50 gene; 
3) a substitution of one or more nucleotides of a GL50 gene, 4) a chromosomal rearrangement of a GL50 gene; 5) an 
alteration in the level of a messenger RNA transcript of a GL50 gene, 6) aberrant modification of a GL50 gene, such as 
of the methylation pattern of the genomic DNA, 7) the presence of a non-wild type splicing pattern of a messenger RNA 
transcript of a GL50 gene, 8) a non-wild type level of a GL50 polypeptide, 9) allelic loss of a GL50 gene, and 10) 
inappropriate post-translational modification of a GL50 polypeptide. As described herein, there are a large number of 
assay techniques known in the art which can be used for detecting alterations in a GL50 gene. A preferred biological 
sample is a tissue or serum sample isolated by conventional means from a subject, e.g., a cardiac tissue sample. 
[0289] In certain embodiments, detection of the alteration involves the use of a probe/primer in a polymerase chain 
reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 4,683,202), such as anchor PCR or RACE PCR, or, alterna- 
tively, in a ligation chain reaction (LCR) (see, e.g., Landegran et al. (1988) Science 241:1077-1080; and Nakazawa et 
al. (1994) Proc. Natl. Acad. Sci. USA 91:360-364), the latter of which can be particularly useful for detecting point 
mutations in the GL50 gene (see Abravaya et al. (1995) Nucleic Acids Res. 23:675-682). This method can include the 
steps of collecting a sample of cells from a patient, isolating nucleic acid {e.g., genomic, mRNA or both) from the cells 
of the sample, contacting the nucleic acid sample with one or more primers which specifically hybridize to a GL50 gene 
under conditions such that hybridization and amplification of the GL50 gene (if present) occurs, and detecting the 
presence or absence of an amplification product, or detecting the size of the amplification product and comparing the 
length to a control sample. It is anticipated that PCR and/or LCR may be desirable to use as a preliminary amplification 
step in conjunction with any of the techniques used for detecting mutations described herein. 

[0290] Alternative amplification methods include: self sustained sequence replication (Guatelli, J.C. et al., (1 990) Proc. 
Natl. Acad. Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh, D.Y. et al., (1989) Proc. Natl. Acad. 
Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi, P.M. et al. (1988) Bio-Technology 6:1197), or any other nucleic 
acid amplification method, followed by the detection of the amplified molecules using techniques well known to those of 
skill in the art. These detection schemes are especially useful for the detection of nucleic acid molecules if such molecules 
are present in very low numbers. 

[0291] In an alternative embodiment, mutations in a GL50 gene from a sample cell can be identified by alterations in 
restriction enzyme cleavage patterns. For example, sample and control DNA is isolated, amplified (optionally), digested 
with one or more restriction endonucleases, and fragment length sizes are determined by gel electrophoresis and 
compared. Differences in fragment length sizes between sample and control DNA indicates mutations in the sample 
DNA. Moreover, the use of sequence specific ribozymes (see, for example, U.S. Patent No. 5,498,531) can be used to 
score for the presence of specific mutations by development or loss of a ribozyme cleavage site. 
[0292] In other embodiments, genetic mutations in GL50 can be identified by hybridizing a sample and control nucleic 
acids, e.g., DNA or RNA, to high density arrays containing hundreds or thousands of oligonucleotides probes (Cronin, 
M.T. et al. (1996) Human Mutation 7: 244-255; Kozal, M.J. et al. (1996) Nature Medicine 2: 753-759). For example, 
genetic mutations in GL50 can be identified in two dimensional arrays containing light-generated DNA probes as described 
in Cronin, M.T. et al. supra. Briefly, a first hybridization array of probes can be used to scan through long stretches of 
DNA in a sample and control to identify base changes between the sequences by making linear arrays of sequential 
overlapping probes. This step allows the identification of point mutations. This step is followed by a second hybridization 
array that allows the characterization of specific mutations by using smaller, specialized probe arrays complementary 
to all variants or mutations detected. Each mutation array is composed of parallel probe sets, one complementary to the 
wild-type gene and the other complementary to the mutant gene. 

[0293] In yet another embodiment, any of a variety of sequencing reactions known in the art can be used to directly 
sequence the GL50 gene and detect mutations by comparing the sequence of the sample GL50 with the corresponding 
wild-type (control) sequence. Examples of sequencing reactions include those based on techniques developed by Maxam 
and Gilbert ((1977) Proc. Natl. Acad. Sci. USA 74:560) or Sanger ((1977) Proc. Natl. Acad. Sci. USA74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures can be utilized when performing the diagnostic 
assays ((1995) Biotechniques 19:448), including sequencing by mass spectrometry (see, e.g., PCT International Pub- 
lication No. WO 94116101 ; Cohen et al. (1996) Adv. Chromatogr. 36:127-162; and Griffin et al. (1993) Appl. Biochem. 
Biotechnol. 38:147-159). 

[0294] Other methods for detecting mutations in the GL50 gene include methods in which protection from cleavage 
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agents is used to detect mismatched bases in RNA/RNA or RNA/DNA heteroduplexes (Myers et al. (1 985) Science 230: 
1242). In general, the art technique of "mismatch cleavage" starts by providing heteroduplexes formed by hybridizing 
(labeled) RNA or DNA containing the wild-type GL50 sequence with potentially mutant RNA or DNA obtained from a 
tissue sample. The double-stranded duplexes are treated with an agent which cleaves single-stranded regions of the 
duplex such as which will exist due to basepair mismatches between the control and sample strands. For instance, 
RNA/DNA duplexes can be treated with RNase and DNA/DNA hybrids treated with S 1 nuclease to enzymatically 
digesting the mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 
hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched regions. After digestion of the 
mismatched regions, the resulting material is then separated by size on denaturing polyacrylamide gels to determine 
the site of mutation. See, for example, Cotton et al. (1988) Proc. Natl. Acad. Sci. USA 85:4397; Saleeba et al. (1992) 
Methods Enzymol. 217:286-295. In a preferred embodiment, the control DNA or RNA can be labeled for detection. 
[0295] In still another embodiment, the mismatch cleavage reaction employs one or more proteins that recognize 
mismatched base pairs in double-stranded DNA (so called "DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in GL50s obtained from samples of cells. For example, the mutY enzyme of E, 
coli cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T at G/T mismatches 
(Hsu et al. (1994) Carcinogenesis 15:1657-1662). According to an exemplary embodiment, a probe based on a GL50 
sequence, e.g., a wild-type GL50 sequence, is hybridized to a cDNA or other DNA product from a testcell(s). The duplex 
is treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be detected from electrophoresis 
protocols or the like. See, for example, U.S. Patent No. 5,459,039. 

[0296] In other embodiments, alterations in electrophoretic mobility will be used to identify mutations in GL50 genes. 
For example, single strand conformation polymorphism (SSCP) may be used to detect differences in electrophoretic 
mobility between mutant and wild type nucleic acids (Orita et al. (1989) Proc. Natl. Acad. Sci. USA: 86:2766, see also 
Cotton (1993) Mutat. Res. 285:125-144; and Hayashi (1992) Genet. Anal. Tech. Appl. 9:73-79). Single-stranded DNA 
fragments of sample and control GL50 nucleic acids will be denatured and allowed to renature. The secondary structure 
of single-stranded nucleic acids varies according to sequence, the resulting alteration in electrophoretic mobility enables 
the detection of even a single base change. The DNA fragments may be labeled or detected with labeled probes. The 
sensitivity of the assay may be enhanced by using RNA (rather than DNA), in which the secondary structure is more 
sensitive to a change in sequence. In a preferred embodiment, the subject method utilizes heteroduplex analysis to 
separate double stranded heteroduplex molecules on the basis of changes in electrophoretic mobility (Keen et al. (1 991 ) 
Trends Genet 7:5,1 

[0297] In yet another embodiment the movement of mutant or wild-type fragments in polyacrylamide gels containing 
a gradient of denaturant is assayed using denaturing gradient gel electrophoresis (DGGE) (Myers et al. (1985) Nature 
313:495). When DGGE is used as the method of analysis, DNA will be modified to insure that it does not completely 
denature, for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich DNA by PCR. In a further 
embodiment, a temperature gradient is used in place of a denaturing gradient to identify differences in the mobility of 
control and sample DNA (Rosenbaum and Reissner (1987) Biophys. Chem. 265:12753). 

[0298] Examples of other techniques for detecting point mutations include, but are not limited to, selective oligonu- 
cleotide hybridization, selective amplification, or selective primer extension. For example, oligonucleotide primers may 
be prepared in which the known mutation is placed centrally and then hybridized to target DNA under conditions which 
permit hybridization only if a perfect match is found (Saiki et al. (1986) Nature 324:163); Saiki et al. (1989) Proc. Natl. 
Acad. Sci. USA 86:6230). Such allele specific oligonucleotides are hybridized to PCR amplified target DNA or a number 
of different mutations when the oligonucleotides are attached to the hybridizing membrane and hybridized with labeled 
target DNA. 

[0299] Alternatively, allele specific amplification technology which depends on selective PCR amplification may be 
used in conjunction with the instant invention. Oligonucleotides used as primers for specific amplification may carry the 
mutation of interest in the center of the molecule (so that amplification depends on differential hybridization) (Gibbs et 
al. (1 989) Nucleic Acids Res. 1 7:2437-2448) or at the extreme 3' end of one primer where, under appropriate conditions, 
mismatch can prevent, or reduce polymerase extension (Prossner et al. (1993) Tibtech 1 1 :238). In addition it may be 
desirable to introduce a novel restriction site in the region of the mutation to create cleavage-based detection (Gasparini 
et al. (1992) Mol. Cell Probes 6:1). It is anticipated that in certain embodiments amplification may also be performed 
using Taq ligase for amplification (Barany (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, ligation will occur 
only if there is a perfect match at the 3' end of the 5' sequence making it possible to detect the presence of a known 
mutation at a specific site by looking for the presence or absence of amplification. 

[0300] The methods described herein may be performed, for example, by utilizing prepackaged diagnostic kits com- 
prising at least one probe nucleic acid or antibody reagent described herein, which may be conveniently used, e.g., in 
clinical settings to diagnose patients exhibiting symptoms or family history of a disease or illness involving a GL50 gene. 
[0301] Furthermore, any cell type or tissue in which GL50 is expressed may be utilized in the prognostic assays 
described herein. 
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VII. Administration of GL50 Modulating Agents 

[0302] GL50 modulating agents of the invention are administered to subjects in a biologically compatible form suitable 
for pharmaceutical administration in wVoto either enhance or suppress T cell mediated immune response. By "biologically 

5 compatible form suitable for administration in vivo" is meant a form of the protein to be administered in which any toxic 
effects are outweighed by the therapeutic effects of the protein. The term subject is intended to include living organisms 
in which an immune response can be elicited, e.g., mammals. Examples of subjects include humans, dogs, cats, mice, 
rats, and transgenic species thereof. Administration of an agent as described herein can be in any pharmacological form 
including a therapeutically active amount of an agent alone or in combination with a pharmaceutical^ acceptable carrier. 

10 [0303] Administration of a therapeutically active amount of the therapeutic compositions of the present invention is 
defined as an amount effective, at dosages and for periods of time necessary to achieve the desired result. For example, 
a therapeutically active amount of a GL50 modulating agent may vary according to factors such as the disease state, 
age, sex, and weight of the individual, and the ability of peptide to elicit a desired response in the individual. Dosage 
regima may be adjusted to provide the optimum therapeutic response. For example, several divided doses may be 

15 administered daily or the dose may be proportionally reduced as indicated by the exigencies of the therapeutic situation. 
[0304] The GL50 modulating agent {e.g., a peptide, a nucleic acid molecule, or an antibody) may be administered in 
a convenient manner such as by injection (subcutaneous, intravenous, etc.), oral administration, inhalation, transdermal 
application, or rectal administration. Depending on the route of administration, the active compound may be coated in 
a material to protect the compound from the action of enzymes, acids and other natural conditions which may inactivate 

20 the compound. For example, to administer GL50 modulating agent by other than parenteral administration, it may be 
necessary to coat the peptide with, or co-administer the peptide with, a material to prevent its inactivation. 
[0305] A GL50 modulating agent may be administered to an individual in an appropriate carrier, diluent or adjuvant, 
co-administered with enzyme inhibitors or in an appropriate carrier such as liposomes. Pharmaceutically acceptable 
diluents include saline and aqueous buffer solutions. Adjuvant is used in its broadest sense and includes any immune 

25 stimulating compound such as interferon. Adjuvants contemplated herein include resorcinols, non-ionic surfactants such 
as polyoxyethyleneoleyl ether and n-hexadecyl polyethylene ether. Enzyme inhibitors include pancreatic trypsin inhibitor, 
diisopropylfluorophosphate (DEEP) and trasylol. Liposomes include water-in-oil-in-water emulsions as well as conven- 
tional liposomes (Sterna et al., (1984) J. Neuroimmunol 7:27). 

[0306] The active compound may also be administered parenterally or intraperitoneal^. Dispersions can also be 

30 prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of storage 
and use, these preparations may contain a preservative to prevent the growth of microorganisms. 
[0307] Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions (where water soluble) 
or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. In 
all cases, the composition must be sterile and must be fluid to the extent that easy syringeability exists. It must be stable 

35 under the conditions of manufacture and storage and must be preserved against the contaminating action of microor- 
ganisms such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for example, water, 
ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyetheylene glycol, and the like), and suitable 
mixtures thereof. The proper fluidity can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of surfactants. Prevention of the action 

40 of microorganisms can be achieved by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, 
phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, 
sugars, polyalcohols such as manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. 

45 [0308] Sterile injectable solutions can be prepared by incorporating active compound (e.g., a GL50 polypeptide or 
anti-GL50 antibody) in the required amount in an appropriate solvent with one or a combination of ingredients enumerated 
above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the active 
compound into a sterile vehicle which contains a basic dispersion medium and the required other ingredients from those 
enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods 

50 of preparation are vacuum drying and freeze-drying which yields a powder of the active ingredient (e.g., peptide) plus 
any additional desired ingredient from a previously sterile-filtered solution thereof. 

[0309] When the active compound is suitably protected, as described above, the protein may be orally administered, 
for example, with an inert diluent or an assimilable edible carrier. As used herein "pharmaceutically acceptable carrier" 
includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption 
55 delaying agents, and the like. The use of such media and agents for pharmaceutically active substances is well known 
in the art. Except insofar as any conventional media or agent is incompatible with the active compound, use thereof in 
the therapeutic compositions is contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 
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[031 0] It is especially advantageous to formulate parenteral compositions in dosage unit form for ease of administration 
and uniformity of dosage. Dosage unit form as used herein refers to physically discrete units suited as unitary dosages 
for the mammalian subjects to be treated; each unit containing a predetermined quantity of active compound calculated 
to produce the desired therapeutic effect in association with the required pharmaceutical carrier. The specification for 
the dosage unit forms of the invention are dictated by and directly dependent on (a) the unique characteristics of the 
active compound and the particular therapeutic effect to be achieved, and (b) the limitations inherent in the art of 
compounding such an active compound for the treatment of sensitivity in individuals. 

[031 1 ] In one embodiment of the present invention a therapeutically effective amount of an antibody to a GL50 polypep- 
tide is administered to a subject. As defined herein, a therapeutically effective amount of antibody (i.e., an effective 
dosage) ranges from about 0.001 to 30 mg/kg body weight, preferably about 0.01 to 25 mg/kg body weight, more 
preferably about 0.1 to 20 mg/kg body weight, and even more preferably about 1 to 1 0 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 
4 to 7 mg/kg, or 5 to 6 mg/kg body weight. The skilled artisan will appreciate that certain factors may influence the dosage 
required to effectively treat a subject, including but not limited to the severity of the disease or disorder, previous treat- 
ments, the general health and/or age of the subject, and other diseases present. Moreover, treatment of a subject with 
a therapeutically effective amount of an antibody can include a single treatment or, preferably, can include a series of 
treatments. In a preferred example, a subject is treated with antibody in the range of between about 0.1 to 20 mg/kg 
body weight, one time per week for between about 1 to 10 weeks, preferably between 2 to 8 weeks, more preferably 
between about 3 to 7 weeks, and even more preferably for about 4, 5, or 6 weeks. It will also be appreciated that the 
effective dosage of antibody used for treatment may increase or decrease over the course of a particular treatment. 
Changes in dosage may result from the results of diagnostic assays as described herein. 

[031 2] Monitoring the influence of agents (e.g., drugs or compounds) on the expression or activity of a GL50 polypeptide 
can be applied not only in basic drug screening, but also in clinical trials. For example, the effectiveness of an agent 
determined by a screening assay as described herein to increase GL50 gene expression, protein levels, or upregulate 
GL50 activity, can be monitored in clinical trials of subjects exhibiting decreased GL50 gene expression, protein levels, 
or downregulated GL50 activity. Alternatively, the effectiveness of an agent determined by a screening assay to decrease 
GL50 gene expression, protein levels, or downregulate GL50 activity, can be monitored in clinical trials of subjects 
exhibiting increased GL50 gene expression, protein levels, or upregulated GL50 activity. In such clinical trials, the 
expression or activity of a GL50 gene, and preferably, other genes that have been implicated in a disorder can be used 
as a "read out" or markers of the phenotype of a particular cell. 

[031 3] For example, and not by way of limitation, genes, including GL50, that are modulated in cells by treatment with 
an agent (e.g., compound, drug or small molecule) which modulates GL50 activity (e.g., identified in a screening assay 
as described herein) can be identified. Thus, to study the effect of agents on a GL50 associated disorder, for example, 
in a clinical trial, cells can be isolated and RNA prepared and analyzed for the levels of expression of GL50 and other 
genes implicated in the GL50 associated disorder, respectively. The levels of gene expression (i.e., a gene expression 
pattern) can be quantified by Northern blot analysis or RT-PCR, as described herein, or alternatively by measuring the 
amount of protein produced, by one of the methods as described herein, or by measuring the levels of activity of GL50 
or other genes. In this way, the gene expression pattern can serve as a marker, indicative of the physiological response 
of the cells to the agent. Accordingly, this response state may be determined before, and at various points during 
treatment of the individual with the agent. 

[031 4] I n a preferred embodiment, the present invention provides a method for monitoring the effectiveness of treatment 
of a subject with an agent (e.g., an agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, 
or other drug candidate identified by the screening assays described herein) comprising the steps of (i) obtaining a pre- 
administration sample from a subject prior to administration of the agent; (ii) detecting the level of expression of a GL50 
polypeptide, mRNA, or genomic DNA in the pre-administration sample; (iii) obtaining one or more post-administration 
samples from the subject; (iv) detecting the level of expression or activity of the GL50 polypeptide, mRNA, or genomic 
DNA in the post-administration samples; (v) comparing the level of expression or activity of the GL50 polypeptide, mRNA, 
or genomic DNA in the pre-administration sample with the GL50 polypeptide, mRNA, or genomic DNA in the post 
administration sample or samples; and (vi) altering the administration of the agent to the subject accordingly. For example, 
increased administration of the agent may be desirable to increase the expression or activity of GL50 to higher levels 
than detected, i.e., to increase the effectiveness of the agent. Alternatively, decreased administration of the agent may 
be desirable to decrease expression or activity of GL50 to lower levels than detected, i.e. to decrease the effectiveness 
of the agent. According to such an embodiment, GL50 expression or activity may be used as an indicator of the effec- 
tiveness of an agent, even in the absence of an observable phenotypic response. 

[0315] This invention is further illustrated by the following examples which should not be construed as limiting. The 
contents of all references, patents and published patent applications cited throughout this application, as well as the 
Figures and the Sequence Listing are incorporated herein by reference. 
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EXAMPLES 

[0316] The following materials and methods were used in the examples. 

[0317] Mouse strain and RNA isolation: Mice (C57B1/6) injected with 1 0E5 MB49 bladder carcinoma cells were treated 

5 with 1 img/mouse recombinant IL12 on days 7-11 and 14-18. RNA was isolated from lymph nodes on days 9 (75%), 12 
(20%) and 19 (5%) were subsequently pooled. RNA was extracted using RNAStat 60 (teltest B) followed by poly A+ 
RNA enrichment using poly Attract magnetic isolation system (Promega). cDNAs were synthesized with superscript RT 
(Gibco BRL). Additional cDNA sources include a mouse fetal thymus library (C3H/Hej) and mouse peripheral blood 
lymphocytes derived from cardiac puncture of C57B1/6). 

10 [031 8] Signal trap: Signal trap protocols were followed as described by Jacobs et al. (1 997. Gene. 1 98: 289). Briefly, 
size fractionated cDNAs were unidirectionally cloned into the invertase expression plasmid pSUC2T7M130RI. An ex- 
pression library of plasmid clones was generated in E. coli and subsequently introduced into the invertase deficient suc2- 
strain of yeast. Signal trapped clones represented in the yeast library were selected by 2 day culture in YPR agar plates. 
Three hundred and thirty three clones were picked at random, miniprepped and sequenced. 

15 [0319] Sequence analysis: TBIastX, FastX, pFam, Pileup, GrowTree and Sigcleave of GCG Wisconsin package, and 
Geneworks 2.5.1 was used for DNA sequence manipulation, database searching and sequence analysis. In Figure 12, 
identity scores for pileup analysis was determined according to the following values: 1x pair = 1 ; 2x pair = 2; 3x pair = 
3; 3-of-a-kind = 4; 3-of-a-kind plus 1x pair = 5; 2x 3-of-a-kind = 6; 4-of-a-kind = 7; 4-of-a-kind plus 1x pair = 8; 5-of-a- 
kind = 9. The Lasergene DNAstar Genequest module was used for delineating intron-exon boundaries of hGL50 against 

20 Genbank Accession #HS21C098. Further analysis was performed with the SeqWeb Wisconsin GCG package using 
TFASTA, TBLASTN, ProfileScan. Distance-proportional phylograms were generated by GrowTree based on genetic 
distance using Kimura correction algorithms. Graphical output was subsequently reformatted to reflect family clusters. 
[0320] 3'rapid amplification of cDNA ends: 3' RACE was performed using primers (GL50) VL1 18 (CCCGCAGTCT- 
GCGCTCGCACC; SEQ ID NO:7), VL116 (GTCGACCCACCATGCAGCTAAAGTGTCCCTG; SEQ ID NO:8), 

25 (AB014553) VL141 (CGTGTACTGGATCAATAAGACGG; SEQ ID NO:9), VL142 (ACAACAGCCTGCTGGACCAGGC; 
SEQ ID NO: 10), (Poly A-oligo) VL054 (CCAGTGAGCAGAGTGACG; SEQ ID NO:11), VL055 (GAGGACTCGAGCT- 
CAAGC; SEQ ID NO:12). Mouse peripheral blood lymphocytes (PBLs) were enriched for lymphocytes by density cen- 
trifugation using lympholyte M according to the manufacturer's protocol. Human PBLs were isolated by Ficoll-paque 
density centrifugation of human leukopac samples. Total RNA was extracted from lymphocytes as described below. 

30 Reverse transcription was accomplished using primer VL053 (CCAGTGAGCAGAGTGACGAGGACTCGAGCT- 
C A AG CTTTTTTTTTTTT ; SEQ ID NO:18), 5 |mg of total RNA and Superscript RT (Gibco-BRL) according to the manu- 
facturer's protocols in 20 jul I reactions. 0.5 - 1 .0 jul I of RT-synthesized cDNAs were used per RACE procedure. 3' RACE 
was performed according to the method of Frohman, M. A. (1993) Methods Emzymol. 218:340-356. 
[0321] RNA isolation and analysis: Total RNA was derived from CCE ES cells, Swiss Webster embryos/yolk sacs 

35 and C57BI/6 peripheral blood lymphocytes and was extracted using RNAstat 60 (Tel-Test B, Friendswood TX) accom- 
panied with Phase-lock gel barrier (Eppendorf). RNA was fractionated using Northern Max system (Ambion) and blotted 
onto ZetaProbe GT (BioRad) according to the manufacturer's protocols. Multiple tissue RNA panels were purchased 
(Clontech) and used according to the manufacturer's instructions. Blots were hybridized to radiolabeled DNA fragments 
encompassing either nucleotides 984-1340 of the mGL50-2 clone (357 bp; SEQ ID NO:3), corresponding to the 3' 

^o untranslated region, while fragments corresponding to the coding sequence of mGL50 were used to detect both mGL50-1 
and mGL50-2 transcripts. Hybridizations were performed at 65 °C with Express Hyb (Clontech) overnight and subse- 
quently washed with 0.1X SSC and 1% SDS at hybridization temperatures until a suitable signal to noise ratio was 
reached. Blots were exposed to phosphoimage plates and autoradiographic film for imaging. 

[0322] Gene expression analysis: For RT-PCR analysis, first strand cDNA synthesis was performed as described 
45 above for RACE procedures, followed by duplicate 25 \x\ amplification reactions (using Advantage Taq, Clontech) with 
the primers RLEE 001 and RLEE005 for mGL50-1 and primers RLEE 001 and RLEE003 for mGL50-2. Primers GAPDH- 
F and GAPDH-R were used as positive amplification controls. The oligonucleotides GAPDH-F (TGAAGGTCGGTGT- 
GAACGGATTTGGC; SEQ ID NO:19); GAPDH-R (CATGTAGGCCATGAGGTCCACCAC (SEQ ID NO:20); RLEE001 
(CATCACTAGCATTAGCCAGGC; SEQ ID NO:13); RLEE003 (TGATGTTGTG AAGCTG AGTGC ; SEQ ID NO:14); 
50 RLEE005 (TCATGAGCATCGAGCATCG; SEQ ID NO:15); VL142 (ACAACAGCCTGCTGGACCAGGC; SEQ ID NO: 
1 0); VL1 62B (TCACGAGAGCAGAAGGAGCAGGTTCC; SEQ I D NO:1 6); and VL1 63B (GGGCCCCCCAGAACCTGCT- 
GCTTCC; SEQ ID NO:17) were designed for the PCR amplification of the extracellular domain regions of mGL50-1, 
GL50-RACE, AB014553 cDNA and AB014553-RACE clones. Mouse and human cDNA panels derived from poly A+ 
RNA encompassing lymphoid and nonlymphoid tissues (Clontech) was used as a source for PCR analysis. Cycling 
55 conditions were 5 min 95 °C denaturing step followed by 35 cycles of 1 min at 95 °C, 1 min at 60 °C, and 1 min at 72 °C. 
The reaction was terminated following a 1 0 min 72 °C extension: Cycling conditions for mGL50 and mGL50-2 PCR were 
95 °C for 1 min, 60 °C for 1 min, and 72 °C for 2 min. for 33 cycles, while GAPDH PCR was performed using 30 cycles. 
[0323] For Northern blot analysis, commercially prepared RNA blots (Clontech) were hybridized to radiolabeled DNA 



47 



EP 1 900 816 A2 



fragments encompassing nucleotides 1065-1588 of mGL50-1 (494 bp; SEQ ID NO:1), or nucleotides 984-1340 of the 
mGL50-2 clone (357 bp; SEQ ID NO:3). 

[0324] Flow cytometry: COS cells were transfected with mGL50-1 or DAP-1 2 cDNA in pcDNA3.1 -CTGFP expression 
vectors. Transfection was accomplished using lipofectamine transfection reagent (Life Technologies) according to man- 

5 ufacturer's protocols. Cells were harvested 3 days after transfection. 1 0% Rabbit serum was used to block non-specific 
binding to cells. Cells were stained at room temperature for 20 minutes with 200 ng of fusion proteins in 1 00 julI of PBS 
2% FCS. Cells were washed and secondary staining performed with PE-linked goat anti-mouse IgG. Cells were stained 
with propidium iodide immediately prior to flow cytometry. Positive COS transfection control was performed with hCTLA4 
cDNA followed by identification of positively staining cells with PE-linked anti CTLA4. 

10 [0325] Cell suspensions for cytometric analysis were isolated from Balb/c splenocytes (-3 months old) and washed 
once with DM EM, 1 0% (vol/vol) heat-inactivated fetal calf serum (JR Bioscience), 2 mM L-glutamine, 1 00 U/ml penicillin, 
100 img/ml streptomycin (Irvine Scientific, Santa Ana, CA), 20 |ulM 2-betamercaptoethanol (Sigma Co., St. Louis, MO), 
MEM sodium pyruvate, and MEM Non-essential amino acids (Life Technologies, Rockville, MD). Red blood cells were 
lysed with ACT lysing buffer and washed once. Splenocytes (-1 X 1 0 7 cells/ml/well) from Balb/c mice were cultured with 

15 25 img/ml LPS (Sigma) or 10 ng/ml PMA, 1 jjig/ml ionomycin. Cells were stained with FITC-labeled antibodies (BD- 
Pharmingen) and mlCOS-mlgG2am reagent, followed by flow cytometric analysis using the FACalibur and CellQuest 
software package (BD). Cell separation was performed using anti-FITC microbead magnetic selection (Miltenyi Biotec) 
followed by flow cytometric determination of T-cell enrichment. 

[0326] Ig fusion proteins: Fusion proteins of lgG2a with mICOS, hICOS, mGL50-1 , and hGL50 were constructed for 
20 use in the following examples. The notation lgG2am indicates that the lgG2a domain was mutated to reduce effector 
function (as in Steurer, W. et al. (1 995) J. Immunol. 1 55:1 1 65-74). The nucleotide and amino acid sequences of hlCOS- 
mlgG2am are presented in Figure 26 and set forth as SEQ ID NOs:23 and 24, respectively. The nucleotide and amino 
acid sequences of mlCOS-mlgG2am are presented in Figure 27 and set forth as SEQ ID NOs:25 and 26, respectively. 
The nucleotide and amino acid sequences of hGL50-mlgG2am are presented in Figure 28 and set forth as SEQ ID NOs: 
25 27 and 28, respectively. The nucleotide and amino acid sequences of mGL150-mlgG2am are presented in Figure 29 
and set forth as SEQ ID NOs:29 and 30, respectively. 

Example 1. Isolation of mGL50-1 Molecules. 

30 [0327] cDNAs encoding secreted proteins derived from RNA oflL-12 treated mouse lymph nodes were placed under 
genetic selection for signal sequences by using the Saccharomyces cerevisiae signal sequence trap method (Jacobs 
et al). Of a total of 333 cDNA:invertase clones isolated and sequenced, 1 partial cDNA clone with limited sequence 
identity with B7-1 was identified and termed mGL50-1 (Figure 1 , SEQ ID NO:1 ). RecA mediated full length cDNA isolation 
from a mouse fetal thymus cDNA library resulted in the generation of 4 additional cDNA clones that contained 3' un- 

35 translated regions as well as overlapping the partial signal trapped sequence clone. 

[0328] The consensus 271 8 nucleotide mGL50-1 sequence encoded a 322 amino acid protein with a predicted mass 
of 36 kDa. Hydropathy plot of the open reading frame predicted a structure corresponding to a leader sequence (from 
about amino acids 1-46 of SEQ ID NO:2; encoded by about nucleotides 67 to 195 of SEQ ID NO:1), an extracellular 
domain (from about amino acids 47-279 of SEQ ID NO:2; encoded by about nucleotides 196 to 904 of SEQ ID NO:1), 

^o a hydrophobic transmembrane region (from about amino acids 280-298 of SEQ ID NO:2; encoded by about nucleotides 
905 to 961 of SEQ ID NO:1) and a potential intracellular cytoplasmic domain (from about amino acids 299-322 of SEQ 
ID NO:2; encoded by about nucleotides 962 to 1032 of SEQ ID NO:1). Signal peptide cleavage was predicted at position 
46 in the amino acid sequence. Analysis of mGL50-1 by Pfam protein motif prediction program suggested structural 
similarity to Ig-domain in the cytoplasmic domain of the protein. In keeping with an Ig-like structure, 4 cysteines were 

45 found in the extracellular domain, allowing for the possibility of intramolecular bonding and distinct structural conformation 
corresponding to an IgV-like domain and an IgC-like domain, based on domain delineation. FastX sequence comparison 
in which translated proteins are searched through GenBank database yielded a number of identified cDNA clones with 
sequence similarities including AB014553, B7-1, B7-2, and Y08823. Corresponding domains in polypeptides in the B7 
family are shown in Figure 12. 

50 

Example 2. Isolation of an alternatively spliced form of GL50 

[0329] To determine the extent of transcript heterogeneity, 3' RACE was performed to isolate splice variants of murine 
GL50-1. Using specific, nested 5' oligonucleotide primers corresponding to sequences upstream and including the 
55 initiation start site of mGL50-1 , amplified PCR products were generated from cDNAs derived from mouse PBLs. Upon 
hybridization to radiolabeled oligonucleotides internal to mGL50-1 coding region, clear hybridization signals were de- 
tected. Subsequent cloning of positively hybridizing PCR products followed by sequence analysis revealed RACE se- 
quences of which none were identical to the consensus mGL50-1 sequence derived from the mouse fetal thymus library. 
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Two sets of PCR products, represented by multiple clones with extensive polyadenylation of differing lengths, were 
found to encode an alternatively spliced form of GL50. One representative product, a 1 759 bp product, termed mGL50-2, 
encoded a polypeptide 347 amino acid residues in length with a predicted molecular mass of 39 kDa (Figure 2, Figure 15). 
[0330] An alignment of mGL50-1 and mGL50-2 is presented in Figure 3. Alignment of the mGL50-1 and mGL50-2 

5 sequences demonstrated complete identity from nucleotide 67 (initiation methionine/mGL50-2 RACE priming site) to 
nucleotide 1027 of the cDNA, with the exception of two nucleotides found in multiple mGL50-2 products (nucleotides 
531 and 71 0, leading to an arginine to histidine residue at 237 of the predicted amino acid sequence (Figure 3)). These 
two nucleotide discrepancies are most probably due to strain differences between the mice used for the RNA starting 
material, since multiple separate PCR products encoded identical mismatches. Sequences downstream of position 1 027 

10 of mGL50-1 and position 961 of mGL50-2 were divergent between the two molecules (Figure 3). Both mGL50-1 and 
mGL50-2 sequences contained a consensus AATAAA polyadenylation signal upstream from the poly-A tail (13 bp for 
mGL50-2,16 bp for mGL50-1). As a result of the alternative 3' sequences encoding the carboxy terminus, mGL50-2 
lacked the final 2 amino acids of mGL50-1 but incorporated an additional 27 novel amino acids in the cytoplasmic domain. 
The predicted amino acid sequence of mGL50-2 indicated the presence of three unique tyrosine residues, Y325, Y328, 

15 and Y333, in the carboxy terminus, in addition to the tyrosine residues Y299 and Y307 shared by both the mGL50-1 
and mGL50-2 molecules. GenBank database search revealed no cDNA sequences with similarity to the divergent coding 
3' domain of the anGL50-2 product, with the exception of a complex repetitive sequence (bases 1349-1554) also found 
in numerous genomic sequences (e.g. Accession numbers AC00581 8, AC006508, and AF1 1551 7), as well as in known 
mRNAs (mouse desmin: Z18892; and mouse servivin: AF115517). No such untranslated repetitive sequences were 

20 found in mGL50-1. 

Example 3. Identification of a human ortholoq of GL50 

[0331] After the murine GL50 clones were identified, database search and subsequent comparisons suggested that 
25 mouse mGL50-1 and mGL50-2 clones may have homology with a cDNA isolated from human brain, KIAA clone 0653 
(accession # AB014553; Ishikawa et al. (1998) DNA Res. 5:169). AB014553 has been described as a 4.3 kb cDNA 
localized on chromosome 21, encoding a putative 558 amino acid protein with a molecular mass of 60 kDa. Because 
both the length of the AB014553 cDNA and the encoded protein were nearly 2 fold greater than mGL50-1, it was not 
likely that AB014553 was a human ortholog ofthe mouse GL50 sequences. However, analysis of the first 303 residues 
30 of the deduced AB014553 protein sequence indicated similarity with mGL50-1 , excluding the signal peptide region of 
the cDNA. 

[0332] Because AB01 4553 was derived by size fractionation of large cDNAs, AB01 4553 was believed to represent a 
variant transcript that also existed as a smaller gene product. To address whether such a smaller product existed, 3' 
RACE analysis of human PBLs with oligonucleotides primers (VL 142 (ACAACAGCCTGCTGGACCAGGC; SEQ ID NO: 

35 10) and VL141 (CGTGTACTGGATCAATAAG ACGG ; SEQ ID NO:9)) corresponding to extracellular domains of 
AB01 4553 with sequence homology with GL50 were performed. Four RACE products were isolated which encoded an 
open reading frame identical to AB014553 from amino acid residue 24 (starting point of RACE primer) to residue 123 
(Figure 6). From residue 123 onward, the AB014553 RACE product diverged from the cDNA sequence resulting in an 
alternative 88 nucleotides with a 3' coding region encoding 9 amino acids, termination codon, and a short untranslated 

^o domain. This alternative 3' region resulted in a premature termination codon in the AB01 4553 RACE clone as compared 
to AB014553 cDNA (Figure 7). The predicted total length of the deduced polypeptide encoded by this alternatively 
transcribed product, after merging with shared 5' sequences of AB014553 cDNA was 309 amino acids, consistent with 
a human protein orthologous to mouse GL50 protein sequences, referred to as hGL50 (Figure 8). 

45 Example 4. Alignment with chicken B7-1 

[0333] Upon alignment with a previously characterized chicken B7-1 (Accession No. Y08823), a pattern of conserved 
cytoplasmic domain sequences emerged between these molecules. Within the intracellular region, hGL50 protein se- 
quences exhibited 34% identity (9/26 residues aligned) with mGL50-1, while chicken Y08823 exhibited 57% identity 

50 (8/14 residues aligned) with either human or mouse GL50 or GL50-2 resulting in a consensus motif of (R)(R)(R)[XX](Q) 
(H)(X/-)SY(T)(G)(P) (SEQ ID NO:21), wherein amino acids in brackets are variable between the three genes, amino 
acids in parentheses are shared between two of the three genes, and amino acids without brackets or parentheses are 
shared by all three genes. A FastA database search for proteins with homology to this motif yielded two mouse entries, 
Veli-2 (Accession No. AF087694) and MALS-2, a C. elegans LIN-7 homolog (Accession No. AF1 73082), encoding 

55 identical sequences with the motif RRRQQHHSYT (SEQ ID NO:22). This unique domain is localized at the carboxy 
terminus of Veli-2 but is not present in the isoforms Veli-1 or Veli-3, and extends beyond the area of homology with C, 
elegans LIN-7. 
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Example 5. Expression of GL50 molecules 

[0334] mGL50-1 and mGL50-2 specific RT-PCR reactions on commercial cDNA panels resulted in abundant PCR 
products generated in heart, spleen, lung, liver, skeletal muscle, kidney, testis, 7-15 day embryo and PBL. Negligible 
5 product was detected with brain samples for either transcripts while low levels of product was detected in testis samples 
for mGL50-2 (Figure 4). By Northern blot analysis of commercial RNA blots using probes specific to either the shared 
extracellular domain of mGL50-1 and mGL50-2 or to the 3' untranslated regions of either mGL50-2 or mGL50-1 , differential 
hybridization was found between the two molecules. Whereas both the extracellular domain probe and the mGL50-1 
specific probe hybridized to an -2.7 kb message clearly detectable in heart, brain, spleen, lung, liver, skeletal muscle, 
10 kidney and testis samples (identical to the pattern previously seen in blots specific for mGL50-1 (Ling et al. (2000) J. 
Immunol. 164:1653-7), the mGL50-2 specific probe hybridized to a 1.7 kb transcript detected only in heart, spleen and 
kidney samples, suggesting that mGL50-2 transcripts were concurrently transcribed as a limited subset of tissues with 
the highest expression mGL50-1 (Figure 5). In poly A+ RNA blots, hybridization using the mGL50-2 3' UTR specific 
probe was clearly detected in samples representing undifferentiated ES cells, day 10 embryoid bodies, day 12.5 em- 
's bryonic yolk sac, and day 15 fetal liver. In contrast, hybridization using the mGL50-1 cDNA coding sequence probe 
clearly revealed transcript in all samples examined 

[0335] To assess the tissue distribution of AB014553 cDNA and AB014553 RACE clones, RT-PCR/southern blot 
analyses were performed under similar conditions as for the GL50 sequences described above. Using oligonucleotides 
primers specif for the amplification of published AB014553 cDNA (VL142 (ACAACAGCCTGCTGGACCAGGC; SEQ ID 

20 NO:1 0) and VL1 63B (GGGCCCCCCAGAACCTGCTGCTTCC; SEQ ID NO: 1 7)), PCR resulted in the complete absence 
of any detectable AB014553 cDNA signal for all samples tested (FigurelO). Possible explanations for the lack of RT- 
PCR products representing published AB014553 cDNA sequences may be the use of non-optimized oligonucleotides, 
extremely low abundance of the target transcript, or actual absence of this form of the product. RT-PCR conditions 
specific for AB014553 RACE using oligonucleotide primers VL142 and VL162B resulted in the detection of a 350 bp 

25 amplification product in kidney, lung, ovary, fetal liver, and leukocyte, with the highest level of amplified product detected 
in fetal liver. Surprisingly, virtually no signal was detected in spleen, lung, thymus, or lymph nodes. These results are 
consistent with the published report of ABOI4553 transcript distribution (Ishikawa et al. (1998) DNA Res. 5:169) in a 
smaller survey of a tissue cDNA panel, but does not complement the tissue distribution patterns observed for the GL50 
molecules. 

30 [0336] Unlike the mGL50-1 and mGL50-2 clones in which lengthy and divergent 3' untranslated regions were present, 
AB014553 RACE products contained only 88 bp of sequence that diverged from that of AB014553 cDNA. Because of 
this, it was not possible to design-nucleotide probes of sufficient specific activity for the detection of the RACE product. 
Using coding region probe for hGL50 northern hybridizations were performed on commercial human multiple tissue RNA 
blots to assess transcript distribution (Figure 1 1 ). Results indicated the presence of a number of transcripts found in all 

35 tissues with approximate molecular size of 2.4 kg, 3.0kb, 7.0 kb, with highest levels of signal present in brain, heart, 
kidney, and liver samples. Low hybridization signals were detected in colon and thymus. An additional transcript of 8.5 
kb was detected in a subset of the panel, including thymus, spleen, kidney, liver, lung and PBL while a 3.8 kb transcript 
was detected in lung and PBL sample. A unique 1 .1 kb transcript was detected only in PBL samples and corresponded 
to the predicted size of hGL50 if 5' and 3' untranslated sequences were included. Determination of other minor transcripts 

40 was difficult due to the limits of the sensitivity range of the blot. None of the obvious transcripts correlate with the 4.3 kb 
published ABOi4553 cDNA, suggesting that this sequence may not exist in nature or may be expressed at levels lower 
than detectable limits. Comparison between hGL50 blots and hGL50 RT-PCR surveys share the common feature of 
having the greatest signal in kidney tissues and less signal in lymphoid related tissues such as thymus, spleen and PBL. 

45 Example 6. Relationship of the GL50 Polypeptides to Other Polypeptides 

[0337] To determine the extent of relatedness between mGL50-1, hGL50, and human and mouse B7-1 and B7-2, 
protein sequence alignments were performed. From Pileup analysis (Figure 12), 18 amino acid locations aligned iden- 
tically between all six molecules within the extracellular domain. Of the 32 positions that define the predicted IgV-like 

50 and IgC-like folds of the B7 molecule, 13 are identically conserved between all six molecules, most notably the 4 cysteines 
that allow intramolecular folding of domains. Other areas of significant sequence conservation were also seen in the 
extracellular domain, but interestingly the identities ofhGL50/mGL50 sequences in certain locations aligned more closely 
with either B7-1 or B7-2 (identity score of 8). For example, valine residue corresponding to position 77 of mGL50-1 is 
shared by hGL50, and murine and human B7-2 sequences, but not B7-1 . Likewise, the tyrosine at position 78 of mGL50-1 

55 is conserved at corresponding locations in hGL50 and murine and human B7-1 , but not B7-2. Of the 1 6 positions with 
identity scores of 8, 5 positions are shared by mGL50-1/hGL50 and B7-1, 4 positions are shared between mGL50-1, 
hGL50 and B7-2, and 6 positions are shared between B7-1 and B7-2. 

[0338] Based on the peptide structure, these results suggest that the mGL50/hGL50sequences occupies a phyloge- 
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netic space parallel to the B7 family of proteins. Molecular phylogeny analysis (GrowTree) measuring genetic distance 
in terms of substitutions per 100 amino acids resulted in a dendrogram (Figure 13) with independent clustering of 
mGL50/hGL50 (85), m/hB7-2(68) and m/hB7-1 (88). As an outgroup, mmu67065_1 (mouse butyrophilin) was used. The 
chicken clone Y08823 also was found to be more aligned with the GL50/AB014553 sequences (~140) than the B7 
5 sequences (215-320), indicating that these sequences comprised a distinct subfamily of proteins. Distances between 
the GL50/AB01 4553, B7-2 and B7-1 branches were high (21 6-284), suggesting that large numbers of substitutions have 
occurred between these molecules since the inception of the human and rodent lineage. 

[0339] Mouse and human CTLA4 (see e.g., Dariavach, P. et al. (1 988) Eur. J. Immunol. 1 8:1 901 ; GenBank Accession 
Number L15006; U.S. patent 5,434,131) and ICOS (Hutloff et al. (1999) Nature 397:263; WO 98/38216) were also 
10 analyzed for phylogenetic relationships using the same parameters. Genetic distances revealed a pattern that was 
distinct to that seen for the B7-like proteins. As indicated in previous reports, the genetic distance between the mouse 
and human ICOS and CD28 (1 76-2570) was closer than that of CTLA4 (261 -405). By comparison, the genetic distance 
between CD28 and CTLA4was much smaller (143-1670), indicating that the structural relationships between the mem- 
bers of the receptor family were not parallel to that of the ligand family. 

15 

Example 7. Demonstration of binding of GL50 to ICOS 

[0340] To determine whether GL50 was a ligand for murine CTLA4, CD28 or ICOS, transfection binding studies were 
performed with mGL50-1 expression vectors (Figure 14). mGL50-1 or human DAP-12 negative control cDNA were 

20 transfected into COS cells followed by staining with either ICOS-lg, CD28-lg or CTLA-4-lg fusion proteins or normal 
murine Ig. COS cells were stained two days after transfection with 5 img/ml of fusion protein, followed by goat anti-mouse 
PE labeled antibody. By flow cytometry, binding of GL50 transfected COS cells was detected only by the ICOS-lg reagent 
(15%) , while negligible binding was detected for CD28-lg, CTLA4-lg or the normal mouse Ig used as a negative staining 
reagent. No binding of any fusion protein was detected for the DAP-12 cDNA transfectants. These results suggest that 

25 GL50 is a ligand for ICOS-lg. 

[0341 ] Although not found under the specific binding conditions herein, it may be that GL50 is also capable of signaling 
through either CD28 or CTLA-4 given the published data showing the weaker binding activity of the B7 molecules to 
CD28 than CTLA-4 (Greenfield, E. A. et al. (1998) Crit. Rev. Immunol. 18:389) in cell based assays. 

30 Example 8. mGL50-2 transcripts encode functional cell surface proteins 

[0342] To demonstrate that mGL50-2 transcripts encode functional cell surface proteins, vectors expressing the mGL50 
coding regions under the transcriptional control of EF-1 alpha promoter were used to transfect COS cells. By flow 
cytometry, both mlCOS-mlgG2am and hlCOS-mlgG2am were found to bind mGL50-1 and mGL50-2 transfected cells 

35 (9-14%) while negligible binding was observed with mCTLA4-mlgG2am (<1%), indicating that the domains encoded by 
the additional residues in the alternate carboxy-tail found in mGL50-2 do not affect surface mobilization of this protein 
(Figure 17). It is also notable that hlCOS-mlgG2am binds both molecules, suggesting that the ICOS receptors, like 
CTLA4 and CD28 receptors, retain ligand binding capacity when assayed against targets across primate/rodent species 
boundaries. Other mouse cells lines were examined for the presence of surface ICOS-ligand. WEHI231 cells have been 

^o previously shown to have surface expression of both B7-1 and B7-2, whereas ES cells have been shown to display only 
B7-1. mCTLA4-mlgG2am staining of WEHI 231 cells was clearly detectable using 8 ng/ml of reagent, while mlCOS- 
mlgG2am staining was minimally detectable at levels starting at 1 img/ml. These results suggest that the binding affinity 
of mCTLA4-mlgG2am reagent to the B7 molecules is at least 100 fold greater than mlCOS-mlgG2am reagent binding 
to GL50 on WEHI cells, similar to the low binding affinity measured between CD28-lg and B7 proteins. In the presence 

45 of blocking antibodies, mCTLA4-mlgG2am binding to WEHI 231 was totally abrogated, while no effect on mlCOS- 
mlgG2am binding to cells was observed, confirming that neither WEHI 231 B7-1 nor B7-2 potentiates specific binding 
with mlCOS-mlgG2am (Figure 18). To corroborate evidence from RNA blot analysis demonstrating the presence of 
GL50 in cells representative of the very early embryonic environment (see above), undifferentiated CCE ES cells were 
analyzed by direct staining with antibodies to B7-1 and indirect staining with mlCOS-mlgG2am fusion protein. Undiffer- 

50 entiated ES cells stained with anti-B7-1 (Figure 19) revealed a one-log fluorescence shift over background, consistent 
with previous observations (Ling, V. et al. (1998) Exp. Cell. Res. 241:55-65), and a half-log fluorescence shift over 
background with mlCOS-mlgG2am staining, demonstrating the simultaneous surface display of both B7 and GL50 type 
molecules in a system that reflects the undifferentiated inner cell mass of early preimplantation embryos. 

55 Example 9. Expression of GL50 on splenocyte subpopulations 

[0343] Phenotypic analysis of the major splenic cell types exhibiting GL50 surface proteins revealed mICOS-mlg 
binding to be most readily detectable on phenotypic CD1 9+ B cells, although it was apparent that other splenic cell types 
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exhibited ICOS-lg staining (see Figure 30). To further identify other freshly isolated cells that display GL50, wild type 
Balb/C splenocytes were compared to RAG1 -/- splenocytes lacking mature B and T cells. The results are presented in 
Figure 20 and Table 3. 



Table 3 




Balb/C 






RAG1 -/- 




Antibody stain 


n= 


% of Total 


% ICOS-lg 


n= 


% of Total 


% ICOS-lg 






Splenocytes 


positive 




Splenocytes 


positive 


anti-CD3 


10,000 


30% 


10% 


50,000 


<1% 


- 


anti-CD4 


10,000 


25% 


8% 


10,000 


11% 


45% 


anti-CD8a 


10,000 


9% 


10% 


50,000 


<1% 


- 


anti-CD19 


10,000 


65% 


97% 


50,000 


<1% 


- 


anti-CD24 


10,000 


64% 


94% 


10,000 


67% 


28% 


anti- 


10,000 


61% 


97% 


50,000 


6% 


5% 


CD45R/B220 














anti-CD11B 


50,000 


8% 


26% 


10,000 


37% 


31% 


anti-CD11C 


50,000 


2% 


43% 


10,000 


20% 


55% 


anti-pan NK 


50,000 


3% 


20% 


10,000 


9% 


3% 


anti class II 


10,000 


65% 


95% 


10,000 


27% 


3% 


anti CD40 


10,000 


61% 


97% 


10,000 


<1% 




anti CD69 


10,000 


2% 


25% 


50,000 


3% 


5% 



[0344] As expected, Balb/C splenocytes revealed high levels of mlCOS-mlgG2am binding (Figures 20A and B) to 
phenotypic B cells (CD19, B220, CD40 >94%), while lower levels were found on phenotypic T cells and T cell subsets 
(CD3+, CD4+, and CD8+; <10 %), macrophages (CD11b, 26%), dendritic cells (CD11c, 43%) and NK-cells (pan-NK, 
20%). mlCOS-mlgG2am binding was also detected on the more general lymphoid markers CD24 and class II (94%) 
cells. Northern blot analysis (using an mGL50-1 specific probe) demonstrated that GL50 transcripts are expressed in 
the splenocytes of RAG 1 -/- mice. This suggested that in the absence of mature T or B cells, GL50 was still expressed 
on other splenocyte subpopulations. Consistent with these observations, analysis of RAG1 -/- splenocytes (Figure 20B) 
demonstrated that they are CD3-, CD8-, CD19-, and CD40-, and that the remaining CD1 1 b+ (35%) and CD1 1c+ (55%) 
cells are readily counterstained with mlCOS-mlgG2am. Low levels (<5%) of ICOS-lg staining were also apparent in 
B220+, panNK+, and CD69+ cells. It is not currently understood why there is a disparity in mICOS-mlg staining levels 
between these three markers on RAG1 -/- splenocytes, when compared with the higher levels detected in Balb/c splen- 
ocytes. mlCOS-mlgG staining of CD4+ (45%) and CD24+ (28%) cells was also apparent in RAG1 -/- splenocytes despite 
the absence of staining for other T cell markers. CD4+ staining has previously been reported on dendritic cells (Aicher, 
A. et al. (2000) J. Immunol. 1 64:4689-96), and this was supported by the presence of a CD4+, CD1 1 c+ double positive 
cell population in these mice (Figure 20C). The presence of GL50 transcripts in conjunction with mlCOS-mlgG binding 
of phenotypic macrophage and dendritic cell subsets in RAG1 -/- splenocytes verifies the presence of ICOS-ligand on 
professional antigen presenting cells that may potentiate signaling through ICOS in vivo. 

Example 1 0. Expression of GL50 splice variant mRNAs in splenocyte subpopulations and embryonic cells 

[0345] Because ICOS-ligand appeared to exist as at least two splice variants, experiments were performed to semi- 
quantitatively assess the presence GL50-1 and GL50-2 transcripts in splenocyte cell populations. Balb/C splenocytes 
cultured in the presence of LPS or ConA were found to upregulate ICOS-ligand in all splenocytes examined (Figure 20). 
To determine if preferential stimulation of these cells caused differential upregulation of GL50-1 or GL50-2 transcripts, 
GL50-1 and GL50-2 transcripts were detected by RT-PCR using transcript specific oligonucleotide primers and hybrid- 
ization probe sets. The results are presented in Table 4. 
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Table 4. RT-PCR Analysis of 
mGL50 Isoforms 
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LPS 






ConA 






5 




w.t. 






















mGL50 


mGL50 
-B 


GAPDH 


mGL50 


mGL50 -B 


GAPDH 


mGL50 


mGL50 -B 
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Spleen 


+ 


+ 


(+) 


+ 


+ 


(+) 


+ 


+/- 


(+) 


10 


CD4 


+ 


+ 


(+) 


+ 


+/- 


(+) 


+ 


+/- 


(+) 




CD8 






(+) 


+/- 




(+) 


+ 


+/- 


(+) 




CD19 


++ 


++ 


(+) 


++ 


++ 


(+) 


+ 




(+) 




RAG-1 


+ 


+/- 


(+) 














15 


-/- 
Spleen 
RAG- 
1-/- 


+ 


+ 


(+) 














20 


CD1 1 c 
RAG- 

1-/- 
CD11b 


+ 


+/- 


(+) 














25 


F5M 
F5M 
LPS 
WEHI 
231 


+ 
+ 

+ 


+/- 
+/- 


(+) 
(+) 

(+) 














30 


DO ES 
cells 
D11.5 


++ 
+ 


++ 
+ 


(+) 
(+) 














35 


Embryo 




















D12.5 
Embryo 
D11.5 


++ 
++ 


+ 
+ 


(+) 
(+) 















Yolk 



Water 
Control 

Amplifications were performed in duplicate followed by autoradiographic detection of blotted GL50 samples. 
45 - represents the absence of signal in duplicate samples. 

+/- represents the presence of signal in one of the duplicate samples. 

+ represents the presence of signal within both members of duplicate samples. 

++ represents autoradiographic saturation of signal within duplicate samples. 

50 (+) represents visual detection of amplified GAPDH products by ethidium bromide staining 



[0346] Balb/C CD4+, CD8+ and CD1 9+ cell subsets and RAG1 -/- CD1 1 b+ and CD1 1 c+ cell subsets were enriched 
to >90% purity by bead separation. Duplicate RT-PCR analyses of quantity- normalized RNA samples revealed GL50-1 
and GL50-2 transcripts to be present in non-treated CD4+ T-cells and CD19+ B cells, consistent with results from flow 
cytometric analysis. However, neither GL50-1 nor GL50-2 transcripts were amplified in CD8+ T cells, despite surface 
protein detection by FACS and enrichment of IGOS-ligand positive cells. It is possible that CD8 GL50 expression is 
below the threshold of detectability by RT-PCR, or that CD8+ ICOS-ligand is yet another variant of GL50 not targeted 
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for detection by this assay. Also, one cannot rule out the possibility that the form of ICOS ligand appearing on CD8+ 
cells may not be GL50-1 or GL50-2, as described herein, or that the CD8+ ICOS ligand may originate elsewhere as a 
soluble protein and become transferred to this cell type. LPS activation led to a profile similar to that seen for control 
cells, with the exception that low levels of GL50-1 were detected in CD8+ samples, suggesting that LPS stimulation of 
B cells may indirectly upregulate expression of this form of ICOS-ligand on T cells. ConA stimulation of splenocytes 
resulted in the amplification of GL50-1 transcripts across all samples with a decrease of product in CD1 9+ cells. GL50-2 
transcripts were induced in CD8+ samples and were not detected in CD1 9+ samples. The decrease of amplified product 
of both GL50-1 and GL50-2 in CD1 9+ cells suggests a regulation of B cell transcription upon exposure to ConA. In RAG1 
-/- splenocytes, GL50-1 and GL50-2 were detected in CD1 1b+ and CD1 1c+ positive cells, while cultured dendritic F5M 
and WEHI231 cells exhibited GL50-1 transcripts. Low levels of GL50-2 were detected in WEHI 231 and LPS activated 
F5M cells, while no amplified product was detected in uninduced F5M cells. In samples representing embryonic tissues, 
GL50-1 and GL50-2 were detected in all samples, with abundant levels of both splice variants present on DO ES cells. 
High levels of GL50-1 were also detected in day 1 2.5 embryo and 1 1 .5 yolk sac samples. These results correlate with 
the degree of transcript hybridization shown by RNA blot analysis (see above). 

Example 1 1 . The chicken GL50-like molecule Y08823 does not bind ICOS 

[0347] Very recently, the crystal structure of B7-1 was resolved at the three angstrom level, revealing a structure 
comprised of parallel, 2-fold rotationally symmetric homodimers with charged residues in the amino-terminal domain of 
B7-1 responsible for direct interactions with CD28/CTLA4. Human and mouse GL50, B7-1 , and B7-2 protein sequences 
exhibit 19-27% sequence identity (Table 5) suggesting that they may also share structural similarities. 



Table 5. Alignment scores between GL50, B7-1 , and B7-2 related proteins 



Percent Sequence Identity 
hGL50 Y08823 mGL50 mGL50 -B hB7-2 mB7-2 hB7-1 mB7. 





hGL50 




36 


44 


44 


19 


24 


25 


22 




Y08823 
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37 


28 
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30 


Genetic 


mGL50 


85 


131 




99 


24 


25 


24 
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mGL50 
-B 


85 


131 


0.4 




26 


23 


26 


26 




hB7-2 


270 


230 


221 


221 
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26 


30 




mB7-2 


251 


310 


200 


200 


68 




24 


28 




hB7-1 


243 


224 


247 


247 


222 


243 




45 




mB7-1 


261 


223 


282 


282 


190 


182 


88 






mmu67 065 


188 


219 


214 


214 


207 


248 


220 


265 



Previous analysis of Y08823 suggested that beta strands forming the DEB and non-twisted AGFCC'C" beta sheets 
within the amino terminal domain were predicted to be conserved between Y08823 and B7-1 (Ikemizu, S. et al. (2000) 
Immunity 1 2:51 -60). Interestingly, the highest degree of predicted secondary structure conservation between the GL50 
sequences and Y08823 was also within the regions encompassing the DEB beta sheets of the corresponding amino 
terminal domain. Predictions based on these structural homologies suggest that sequence identities in this region could 
provide key interdomain electrostatic contacts and conserve hydropathicity within the interdomain core, resulting in a 
similar molecular framework shared by the GL50 and B7 molecules (Figure 1 6). Based on these observations, chicken 
Y08823 was assessed for the ability to bind ICOS receptors. Sequences representing the mature Y08823 peptide were 
obtained by RT-PCR and subcloned into an expression vector, which upon transfection of COS cells, yielded a functional 
surface protein. Y08823 transfected cells were found to bind CTLA4-lg but not to hlGOS-mlgG2am nor mlCOS-mlgG2am 
(Figure 1 7). Although it cannot be ruled out that the binding of Y08823 to ICOS occurs at levels below detection, based 
on the assay conditions used here, it is not likely that the GL50-like protein Y08823 can cross-function as a ligand for 
human or mouse ICOS receptors. 

[0348] Structural and genetic similarity suggests that B7/GL50 type proteins are conserved across extreme phyloge- 
netic boundaries, and implicit in this interpretation is that mechanistic pathways utilizing these proteins are also shared. 
The evidence that these proteins have similar functions in T cell signaling raise the question ofthe absolute number and 
the origins of costimulatory ligands, their cognate receptors, and derivative spliced variants that exist. Other proteins 
that fit into the B7 Ig-superfamily structure include MOG and butyrophilin, but these proteins have not been determined 
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to participate as ligands in any costimulatory pathway (Henry, J. et al. (1999) Immunol. Today 20:285-8). With the 
sequence availability of chromosome 21 (Hattori, M. et al. (2000) Nature 405:311-9), the genomic organization of the 
human ICOS-ligand was determined, indicating the presence of at least 2 splice variants in the form of hGL50 (Ling, V. 
et al. (2000) J. Immunol. 164:1653-7) and KIAA clone 0653 (Genbank Accession No. AB014453). Among the members 
of the 87-like genes, the genomic structure of B7-1, B7-2, butyrophilin, and hGL50 have been reported. Although the 
absolute number of exons that comprise these genes varies from 5 to 12, these genes share structure, in that distinct 
exons encode the two Ig-like extracellular domains, one exon encodes the transmembrane domain, and multiple exons 
encode the cytoplasmic domain (e.g., two exons for hGL50, two exons for B7-2 (Jellis, C. E. etal. (1995) Immunogenetics 
42:85-9; Borriello, F. et al. (1995) J. Immunol. 155:5490-7), one to two exons for B7-1 (Borriello, F. et al. (1994) J. 
Immunol. 153:5038-48), and three exons for butyrophilin (Ogg, S. L. etal. (1996) Mamm. Genome 7:900-5)). For KIAA065 
3, the splice junction between exons encoding cytoplasmic domains 1 and 2 is not used, resulting in a read-through of 
2.9 kb into the putative intron 6. Upon alignment of KIAA0653 with chromosome 21 BAC clone HS21 C098, the alternative 
3' cytoplasmic domain of KIAA0653 was not found to be in agreement: eight sequence discrepancies were found, 
comprised of 7 mismatches and one 1 7 bp deletion. In contrast, exon sequence alignment of human GL50 to HS21 C098 
revealed no sequence dissimilarities up to and including the polyadenylation site. The examples set forth above show 
that human GL50, mGL50-1, and variant mGL50-2 show some amino acid sequence identity near the splice site for 
cytoplasmic domains 1 and 2 (mGL50-1 residues 31 6-31 8: E-L-T; Figure 1 6). The shared pointof splice variation between 
hGL50/AB014553 and between mGL50-1/mGL50-2 suggests the potential of a conserved mechanism that allows or 
promotes alternative splicing of cytoplasmic domain 2, perhaps to offer alternate signaling through the combinatorial 
addition of alternate functional domains. The observation that mGL50-2 and the original mGL50-1 are transcribed with 
differing tissue specificity supports the notion that regulation of these molecules in cell signaling is dependent on phys- 
iological locale and activation state. 

[0349] The existence of a conserved intracellular motif between mammalian GL50 and avian Y08823, along with the 
presence of multiple forms of GL50 with divergent carboxyl regions, further suggests that differences in the intracellular 
domain of these molecules may lead to distinct signaling functions. This is further supported by the presence of 3 
additional tyrosine residues found in the intracellular domain of mGL50-2, in addition to the 2 shared with mGL50-1. 
This contrasts with the structure of B7-1 and B7-2, where the intracellular regions lack any obvious conserved sequences 
and have been deleted without impairment of costimulatory activity, suggesting that intracellular signaling is not a key 
feature of these B7 proteins (Brunschwig, E. B. et al. (1 995) J. Immunol. 1 55:5498-505). The conserved motif of hGL50, 
although predicted to be in the intracellular portion of the molecule by hydrophobicity analysis, was found to be encoded 
by the exon 5 transmembrane domain, and not the exon 6 cytoplasmic domain-1. In the chicken Y08823 cDNA clone, 
sequence homology terminates within three amino acid residues following the corresponding exon 6/cytoplasmicdomain- 
1 . If the genomic organization of hGL50 is maintained in Y08823, where the conserved motif is encoded by the intracellular 
portion of the exon-5 transmembrane domain, then it is possible that DNA segments orthologous to exon 6 and exon 7, 
encoding cytoplasmic domains 1 and 2 in hGL50, may be completely absent in chicken. In the structural studies of the 
B7 cytoplasmic domain, it is argued that those sequences may be completely dispensable (Brunschwig, E. B. et al. 
(1995) J. Immunol. 155:5498-505). However, the fact that alternate cytoplasmic exons are used in B7-1 and GL50 
suggests that the addition of alternate exon domains may have occurred during the time when the novel B7-like proteins 
were generated. The B7-like butyrophilin proteins are encoded by a number of splice variants, the predominant form of 
which contains a cytoplasmic domain 3 encoding a intracellular Ring finger motif which is perhaps used in transducing 
signaling from this molecule (Ogg, S. L. et al. (1996) Mamm. Genome 7:900-5). These observations support the idea 
that other ligand type molecules, such as GL50 and Y08823, with the conserved intracellular motif from exon-5 and 
other cytoplasmic domains, may have alternate roles as signal delivery and a signal receptor molecules, depending on 
the environmental millieu in which the is cell is found. 

[0350] To clearly define the cell subsets that show surface expression of GL50, comparative phenotyping of RAG 1 
-/- and Balb/C splenocyte subsets was performed. The examples set forth above show that freshly isolated CD4+ and 
CD8+ cells, as well as RAG1 -/- CD1 1c+ cells contained subpopulations of ICOS-ligand expressing cells. These results 
are distinct from previous studies where ICOS-ligand was reported to be absent in T-cell lines (Aicher, A. et al. (2000) 
J. Immunol. 1 64(9):4689-96) and some dendritic cell lines (Yoshinaga, S. K. et al. (1999) Nature 402:827-32). RT-PCR 
analysis of purified cell subsets confirmed that both GL50-1 and GL50-2 were expressed in the same cells suggesting 
that both transcripts may contribute to the surface display of ICOS binding. In addition to antigen presenting cells, it was 
demonstrated that the initial expression of costimulatory ligands occurs early in the ES cell model of embryonic devel- 
opment with the presence of B7-1 and GL50-1 transcripts in undifferentiated cells and in embryoid bodies cultured 10 
days in vitro Ling, V. et al. (1 998) Exp. Cell Res. 241 :55-65). In this study, it is further demonstrated that by RNA analysis, 
GL50-2 transcripts are found within these tissues. By day 9 of embryoid body differentiation, emergent hematopoietic 
cells phenotypically resemble yolk sac hematopoietic progenitors in vivo, as evidenced by the potential of c-kit+/PECAM- 
1+ cells to produce mixed hematopoietic progenitors and CD45+ cells to produce macrophage progenitors in colony- 
forming assays (Ling, V. and Neben, S. (1997) J. Cell Physiol. 171:104-15; Ling, V. et al. (1997) Eur. J. Immunol. 27: 
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509-1 4). These CD45+ cells were also found to be B7-1 + and B7-2+, strongly suggesting costimulatory ligand expression 
occurs very early in lymphopoiesis. Correspondingly, high levels of GL50-1 and GL50-2 expression were found in sites 
of embryonic hematopoiesis such as embryonic day yolk sac and fetal liver. It is noteworthy that ICOS-ligand is inducible 
in embryonic fibroblast cultures, a cell type derived from a time point prior to definitive lymphopoiesis, suggesting that 
the mechanism for costimulatory signaling cascade may be poised independently of the initial formation of adaptive 
immune response. It has been postulated that metazoans share common evolvable pathways that occur at the phylotypic 
stage of embyrogenesis, and that certain core physiological processes which have special properties relevant to complex 
development are reflected during this time period of embryonic development and later in adult physiology (Kirschner, 
M. and Gerhart, J. (1998) Proc. Natl. Acad. Sci. USA 95:8420-7). It remains to be determined whether co stimulatory 
ligands are part of some core processes utilized by both embryos and adult systems. 

[0351] Despite the large genetic distance between the B7 family members, the fact that primate and rodent B7-1 and 
B7-2 retain cross-binding to CTLA4 and CD28 across phylogenetic lines suggests tolerance of nucleotide replacement 
within these signaling molecules through the time course of natural history. To compare the phylogenetic divergence 
pattern between costimulatory ligands and their receptors, protein sequences of CTLA4 (Genbank Accession Nos. NM_ 
009843 and NM_005214), CD28 (Accession Nos. NM_007642, NM_006139, and X67915), and ICOS (Genbank Ac- 
cession No. AJ250559 and Genseq Accession No. V531 99) receptors from mouse, human and chicken were analyzed. 
When represented in graphical format, the genetic distance values of these receptors (Table 6) revealed a pattern (Figure 
21) in which distances between ICOS and CD28 proteins were closer than distances between ICOS and CTLA4. 



Table 6. Alignment Scores between ICOS, CTLA4 and CD28 



Percent Sequence Identity 
hICOS mICOS hCTLA4 mCTLA4 hCD28 mCD28 chCD28 



hICOS 




69 


21 


20 


28 


24 


21 


mICOS 


41 




17 


16 


25 


21 


20 


Genetic Distance hCTLA4 


250 


368 




74 


30 


29 


32 


mCTLA4 


272 


466 


33 




31 


32 


31 


hCD28 


175 


205 


165 


154 




67 


50 


mCD28 


217 


257 


167 


149 


44 




48 


chCD28 


246 


278 


152 


156 


79 


85 





[0352] When comparing receptor sequence relationships between species, distance values for human CD28/ICOS 
(1 76) were smaller than those for mouse CD28/ICOS (257). Likewise, human CTLA4/ICOS distance values (261 ) were 
also found to be smaller than mouse CTLA4/ICOS distances (405). These data suggest that structure of ICOS molecule 
is more likely derived from the form of CD28 rather than CTLA4. In contrast, phylogenetic analysis of the costimulatory 
ligands demonstrated that the distance values between GL50 and B7-1 (243-282) were nearly equivalent to the distance 
values between GL50 and B7-2 (200-270). Y08823 was found to exhibit higher sequence identity and lower genetic 
distance (36-37%; 131-138) to mouse and human GL50 proteins than to B7 proteins (23-30%, 230-310). The near- 
equivalent genetic distances between the GL50 and the B7-1/B7-2 family members and the non-equivalent genetic 
distance between the ICOS and the CD28/CTLA4 family members implies that the evolutionary/functional constraints 
guiding the receptor family is different from those guiding the ligand family. 

[0353] Phylogenetic sequence relationships may reflect genomic placement of these molecules: B7-1 and B7-2 co- 
localize to mouse chromosome 16 and human chromosome 3, while CTLA4, CD28, and ICOS co-localize to mouse 
chromosome 1 and to human chromosome 2q33. In contrast, the GL50 genetic loci are not linked to the B7 loci; human 
GL50 is located at chromosome 21q22 (Hattori, M. et al. (2000) Nature 405:311-9) while mouse GL50 is located on 
chromosome 1 0. By TFastX analysis, no additional GL50-like homologs were identified in chromosome 21 , suggesting 
that GL50 may not exist as a family of genes clustered together like B7-1 and B7-2. With respect to Y08823, it is not 
clear whether this molecule is a true ortholog of B7-1 or whether Y08823 represents a novel B7-like molecule whose 
ortholog has not been defined in mammalian systems. However, from the 23-30% sequence identity shared between 
B7s and Y08823, including multiple amino acid replacements at charged residue sites, it was surprising that these 
proteins retain functional crossbinding to CTLA4 (O'Regan, M. N. et al. (1999) Immunogenetics 49:68-71). The unex- 
pected result of Y08823 bearing stronger structural resemblance to GL50, yet retaining binding properties characteristic 
of B7-1 and B7-2, suggests that structural and functional constraints to the divergence of these costimulatory ligands 
are low. 

[0354] Numerous scenarios may account for the differing genetic distances measured between receptor families and 
ligand families. It is possible that the genes encoding the GL50/B7 family of proteins emerged earlier than genes encoding 
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the CD28/CTLA4 receptors. The formation of genes encoding the ICOS receptor may have arisen later during phylogeny 
and may be based on the structure of CD28, thus resulting in a greater similarity to CD28 than CTLA4 molecules. This 
hypothesis may account for the numerous B7-like proteins that exist, while relatively few CD28-like receptors have been 
described. It is notable that certain exons of CTLA4 retain remarkable sequence constraint, even at the level of synon- 

5 ymous DNA mutations, suggesting the presence of a yet-to-be-defined mechanism that protects that locus from random 
mutations (Ling, V. et al. (1999) Genomics 60:341-355). It may be that a mutation constraining mechanism regulates 
the costimulatory receptor region over the length of the CTLA4/CD28/ICOS loci, or that the added selection pressure 
upon the intracellular signaling domain of these receptors is sufficient to maintain a lower rate of divergence. 
[0355] Costimulatory ligands and receptors belong to the Ig-superfamily of proteins, which have been defined as those 

10 proteins that share homology to immunoglobulins at the 10-20% range, with characteristic intrachain disulfide bonds. 
Ig-superfamily proteins are widely distributed among proteins of different functions and between vertebrate phylogenies. 
The appearance of arthropods and chordates dates back 600 million years, and it has been suggested that molecules 
representing the putative progenitors of the Ig-superfamily are even more ancient, probably being present in the acoe- 
lomates such as flatworms and nematodes. The notion that the Ig-superfamily of proteins is at least as ancient is 

15 supported by the finding that some Ig-like proteins such as N-CAM are found in mammals as well as insects. The 
immunological "big bang" event (Marchalonis, J. J. et al. (1998) Immunol. Rev. 166:103-22, and references therein) 
which gave rise to the Ig-based, combinatorial adaptive immune system theoretically appeared during the emergence 
of jawed fish 450 million years ago over a geologically brief time span of 1 0-20 million years. Currently, no mechanism 
by which the immunoglobulin system may have emerged from the Ig-superfamily of molecules has been clearly defined. 

20 However, theories have been proposed that suggest that genes encoding Ig-domains and recombinase enzymes nec- 
essary for the combinatorial immune system were horizontally transferred on a sufficiently large enough scale to offer 
a selective advantage (Bernstein, R. M. etal. (1996) Proc. Natl. Acad. Sci. USA 93:9454-9). Notably absent is a foundation 
for a comprehensive biochemical framework incorporating the salient signaling features of Ig-superfamily costimulatory 
molecules which serve to trigger cell activation, promote immunoglobulin molecule maturation, and influence class 

25 switching. Although it is not currently known whether extant members of the ancient chondricthyes lineage such as 
sharks have costimulatory molecules, the fact that costimulation related proteins such as CD28 and Y08823 are present 
in chickens suggests that some type of costimulatory pathway was present in members of the avian lineage, which 
emerged at least 300 million years ago (Burt, D. W. et al. (1999) Nature 402:411-3), opening the possibility that the 
Y08823 molecule represents a contemporary cousin to both the GL50 and B7 molecules with a stronger resemblance 

30 to a prototypic costimulatory ligand, rather than being a true ortholog of either GL50 or B7. In contrast to the avian 
lineage, it is postulated that the mouse and human lineages separated approximately 100 million years ago, with the 
mouse genome undergoing extensive chromosomal rearrangements (Burt, D. W. et al. (1 999) Nature 402:41 1 -3) com- 
pared to those seen in chickens and humans. It is not known whether these rearrangements may have led to the 
chromosomal separation between the B7 family members and the genes encoding GL50 molecules. It is also not known 

35 if avian ICOS or variants thereof exist. 

Example 12. Soluble GL50 can costimulate human T cells 

[0356] The ability of soluble hGL50-mlgG2am to costimulate human T cells was determined using a T cell costimulation 
^o assay. Naive CD4+ T cells were purified and plated at 1 0% cells per well. Cells were stimulated with anti-CD3 on beads, 
using one bead per cell and 1 or 2 |mg anti-CD3 per 10 7 beads. Cells were treated with hGL50-mlgG2am on beads, 
using one bead per cell and 3 |mg hGL50-mlgG2am per 10 7 beads. CD28 signaling was provided (using anti-CD28 
(Pharmingen)) or stimulated to determine whether modulation of CD28 mediated costimulation had any effect on hGL50- 
mlgG2am mediated costimulation. 
45 [0357] IL-2 production, IL-10 production, and proliferation ( 3 H incorporation) were assayed as indicators of costimu- 
lation. Cytokines and proliferation were measured 72 hours after stimulation. 

[0358] As shown in Figure 22, hGL50-mlgG2am (also called hGLSO.Fc) can costimulate T cells, as shown by the 
increase in proliferation as well as the induction of IL-2 and IL-10 production. In the presence of antibodies to CD28, 
which induces CD28 mediated costimulation, IL-2 production is also induced. Figure 23 shows the effects of varying 
50 concentrations of anti-CD3 and anti-CD28 on proliferation and cytokine production. 

[0359] Figure 24 shows that adding anti-CD28 to T cells stimulated with anti-CD3 or anti-CD3 and soluble hGL50- 
mlgG2am (to stimulate CD28 mediated costimulation) induces IL-2 production, but does not influence hGL50 mediated 
IL-10 production. 

55 Example 13. Treatment of murine tumors using ICOS/GL50 pathway stimulation 

[0360] As of yet, the role of ICOS/GL50 costimulation in the generation of antitumor responses has not been reported. 
In this study, the relative efficacy of ICOS/GL50 costimulation was compared to CD28/B7 costimulation in various murine 
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tumor models. For systemic treatment of tumor bearing animals, murine B7.2-lgG2a and GL50-lgG2a fusion proteins 
were generated, which consist of the extracellular domain of B7.2 or GL50, respectively, and the Fc portion of murine 
lgG2a. Murine isotype lgG2a was used as a control. Mice bearing MethA or B16F1 melanoma tumors were treated 
subcutaneously with 50 i^g/injection of GL50-lgG2a or B7.2-lgG2a fusion protein twice weekly for three weeks. In the 
MethA model, treatment with B7.2-lgG2a resulted in up to 100% tumor regression (Figure 25A) and cure of the mice 
(Figure 25E), and treatment with GL50-lgG2a resulted in up to 60-90% cure of mice (Figure 25E) and in 40% significant 
tumor growth delay (Figure 25D). In the B16F1 melanoma, systemic treatment with either protein led to comparable 
significant tumor growth delay. In both tumor models, control lgG2a treatment had no effect (Figure 25A, C, and E). In 
tumor vaccines studies, the B16F1 melanoma and the MB49 bladder carcinoma models were used. Tumor cells were 
transduced with a vector containing the EF-1 alpha promoter expressing either murine B7.1 or GL50, and G418 (neo- 
mycin) selected tumor cells were injected subcutaneously for in vivo tumorigenicity experiments. Expression of GL50 
and B7-1 on tumor cells was determined by FACS analysis using an anti-mB7-1 monoclonal antibody (Pharmingen, 
clone 1 6-1 0A1 ) or ICOS-lgG2a fusion protein. The results demonstrate: (i) in the B1 6F1 model, 40% of the mice injected 
with GL50 expressing tumor cells and 20% of the mice injected with B7.1 expressing tumor cells reject their tumors 
(Figure 31 A); (ii) in the MB49 model, 30% of the mice injected with GL50 expressing tumor cells and 10% of the mice 
injected with B7.1 expressing tumor cells reject their tumors (Figure 31 B). These results indicate that enhanced in vivo 
ICOS/GL50 interactions, provided either by soluble GL50-lgG or GL50 expression on tumor cells, has significant antitumor 
activity that is comparable to the well described antitumor efficacy of the CD28/B7 pathway in murine tumor models. 

Equivalents 

[0361] Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many 
equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be encom- 
passed by the following claims. 
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SEQUENCE LISTING 

<110> Ling, V. and Dunussi, K, 

-cl20> NOVEL GL50 MOLECULES AND USES THEREFOR 
<130> GNN-007 



<140> 

10 <141> 



<160> 6 

\ 

<170> Patentln Ver. 2-0 



15 <210> 1 

<2ll> 271B 

<212> DNA 

<2l3> Mus musculus 

<220> 

20 <221> CDS 

<222:> (€7),. {1032) 

<400> 1 

ccggaacccc aaccgctgca actct<:cgcg tccgaaatcc agcatcccgc agtctgcgct 60 

25 cgcacc atg cag eta aag tgt ccc tgt ttt gtg tec ttg gga acc agg 108 

Met Gin Leu Lys Cys Pro Cys Phe Val Ser Leu Gly Thr Arg 
IS 10 

cag cct gtt tgg aag aag etc cat gtt tct age ggg ttc ttt tct ggt 156 
Gin Pro Val Trp Lys Lys Leu His Val Ser Ser Gly Phe Phe Ser Gly 
30 15 20 25 30 

ctt ggt ctg ttc ttg ctg ctg ttg age age etc tgt get gec tct gca 204 
Leu Gly Leu Phe Leu Leu Leu Leu Ser Ser Leu Cys Ala Ala Ser Ala 
35 40 45 

35 9^9" set gaa gtc ggt gca atg gtg ggc age aat gtg gtg etc age tgc 252 

Glu Thr Glu Val Gly Ala Met Val Gly Ser Asn Val Val Leu Ser Cys 
50 55 60 

att gac ccc cae aga cgc cat ttc aac ttg agt ggt ctg tat gtc tat 300 
lie Asp Pro His Arg Arg His Phe Asn Leu Ser Gly Leu Tyr Val Tyr 

40 SB .70 75 



tgg caa ate gaa aac cca gaa gtt teg gtg act tac tac ctg ect tac 348 
Trp Gin lie Glu Asn Pro Glu Val Ser Val Thr Tyr Tyr Leu Pro Tyr 
SO 85 90 ' 

aag tct cca ggg ate aat gtg gac agt tec tac aag aac agg ggc cat 356 
Lys Ser Pro Gly lie Asn Val Asp Ser Ser Tyr Lys Asn Arg Gly His 
95 100 105 110 

ctg tec ctg gac tec atg aag cag ggt aac ttc tct ctg tac ctg aag 444 
Leu Ser Leu Asp Ser Met Lys Gin Gly Asn Phe Ser Leu Tyr Leu Lys 
115 120 125 

aat gtc acc cct cag gat ace cag gag ttc aca tgc egg gta ttt atg 492 
Asn Val Thr Pro Gin Asp Thr Gin Glu Phe Thr Cys Arg Val Phe Met 
130 135 140 
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aat aca gcc aca gag tta gtc aag ate ttg gaa gag gtg gtc agg ctg 540 
Asn Thr Ala Thr Glu Leu Val Lys lie Leu Glu Glu Val Val Arg Leu 
145 150 155 

cgt gtg gca gca aac ttc agt aca cct gtc ate age acc tct gat age 5&6 
Arg Val Ala Ala Asn Phe Ser Thr Pro Val lie Ser Thr Ser Asp Ser 
160 165 170 

tec aac ccg ggc cag gaa cgt acc tac acc tgc atg tec aag aat ggc 636 
Ser Asn Pro Gly Gin Glu Arg- Thr Tyr Thr Cys Met Ser Lys Asn Gly 
175 ISO 185 190 

tac cca gag ccc aac ctg tat tgg ate aac aca acg gac aat age eta 684 
Tyr Pro Glu Pro Asn Leu Tyr Trp lie Asn Thr Thr Asp Asn Ser Leu 
195 200 205 

ata gac acg get ctg cag aat aac act gtc tac ttg aac aag ttg ggc 732 
lie Asp Thr Ala Leu Gin Asn Asn Thr Val Tyr Leu Asn Lys Leu Gly 
210 215 220 

ctg tat gat gta ate age aca tta agg etc cct tgg aca tct cgt ggg 780 
Leu Tyr Asp Val lie Ser Thr Leu Arg Leu Pro Trp Thr Ser Arg Gly 
225 230 235 

gat gtt ctg tgc tgc gta gag aat gtg get etc cac cag aac ate act 828 
Asp Val Leu Cys Cys Val Glu Asn Val Ala Leu His Gin Asn lie Thr 
240 245 250 

age att age cag gca gaa agt ttc act gga aa.t aac aca aag aac cca B7G 
Ser lie Ser Gin Ala Glu Ser Phe Thr Gly Asn Asn Thr Lys Asn Pro 
255 260 265 270 

cag gaa acc cac aat aat gag tta aaa gtc ctt gtc ccc gtc ctt get 924 
Gin Glu Thr His Asn Asn Glu Leu Lys Val Leu Val Pro Val Leu Ala 
275 280 2B5 

gta ctg gcg gca gcg gea ttc gtt tec ttc ate ata tac aga cgc acg 972 
Val Leu Ala Ala Ala Ala Phe Val Ser Phe lie He Tyr Arg Arg Thr 
290 295 300 

35 

cgt ccc cac cga age tat aca gga ccc aag act gta cag ctt gaa ctt 1020 
Arg Pro His Arg Ser Tyr Thr Gly Pro Lys Thr Val Gin Leu Glu Leu 
305 310 315 

aca gac cac gcc tgacaggact ctgcccagga tatggacagg gtttctgtga 1072 
40 Thr Asp His Ala 



25 



30 



50 



320 














gttgccacca 


ggtggatgte 


agacacaact 


tcagagtgga 


cccccacagg 


cctggtgaca 


1132 


gaggaeaacg 


agctgtctgc 


ttatgggctg 


tgatggaggc 


caggaatccc 


tggctttacg 


1192 


aggcacagag 


acttcatccc 


agaaaccccg 


agggagatct 


ctccagtggg 


cagcagcaac 


1252 


ateateggaa 


tatggagect 


ccggtgagct 


gtcggcacag 


agagcagcag 


cttgtgagaa 


1312 


gatccttcct 


tggcacgtta 


ctactcaggc 


ctaggagctt 


tataaaagag 


cgtttgagcc 


13 72 


actctgaaag 


ccctacagag 


tctactggag 


actttccctg 


caggaccttc 


agttggggag 


1432 


gaagectgae 


tttatttagg 


tctcaggcta 


cttgggcctc 


ttcgaggata 


tgtgggattt 


1492 
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tgtctactgc 


aaacctgttt 


ctggctgaca 


atggttgggc 


tcagaggcac 


tcagcttcac 


1552 


aacatcaatg ggacacgcct 


catccttgac 


ttcctgtggc 


tacagaagct 


ttccgaaagc 


1612 


cttgagctct 


ttcagactga 


acagctctgc 


ccagtctcag 


cagcccatga 


agate tcaac 


1672 


tccagcttcc 


tgggtctccg 


tgttgctggc 


cagaatagag 


ctagctcttt 


tgtttcaaga 


1732 


tggttctgca 


aagttggctg 


cttggaaacc 


tagggatgta 


tgtacaagct 


ccaggctgat 


1792 


gcagtagggg 


gcacggactc 


cccgatggaa 


cacagtatct 


gaccctaggt 


gagggcaagc 


1852 


tccttcccac 


gcagaggact 


ggaaattctg 


gaccgtcaag 


gcctgtctgc 


tatgtggctg 


1912 


gggctcagtg 


ctgatggatg 


tgtgagatct 


caggaatgag gagtgagaac 


cctgggctca 


1972 


ggactaggaa 


gacctgtcca 


tttttttttt 


tttttaatgc 


ccacatggac 


tttttattct 


2032 


tcacaccgat 


gtattcaatg 


agtgtagaga 


gaactactta 


agtccttccc 


gagtacaaag 


2092 


cattacctac 


ctgcagaata 


gcaacfcgttg 


ttatgggtct 


tgagttggca 


gctacagcaa 


2152 


acaagcacaa 


ggagcagttg 


gggtgcaaga 


agatggggtg 


cagcgccccc 


aaggacagac 


2212 


atttgggaat 


tagtggtctc 


cctgatgccc 


atagttcccc 


aggaactcag 


gtgggtctgc 


2272 


ggcagcacag 


taggagtatt 


cctcctactt 


taacttttct 


tgtcagacgt 


agfcttaggtt 


2332 


cagaaagagg 


tcaactcagc 


aagccagcta 


gccgccttgg 


ggcaccagac 


acactgcccc 


2392 


ccaccccctg 


cttatgtagg 


cattgggaac 


ccttcacaga 


ccactggctg 


tacagtcacc 


2452 


atcacctgct 


gattccagca 


gjcccccacc 


tfccttgtgga 


atcctgggag 


cactcccctc 


2512 


ttacccctca 


ctgcccccca 


ccccctgcac 


atcagcattc 


attagatttg 


ccctgtaacg 


2572 


tctgattcct 


cctttatctg 


ggttgtagat 


ggggcatagt 


gacttctaga 


aacctaacaa 


2632 


gggaataaat 


gtaagatgtg 


ctttcaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


2692 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaa 








2718 



<210> 2 
<211> 322 
<212> PRT 

<;213> Mus mu3culus 
<400> 2 

Met Gin Leu Lys Cys Pro Cys Phe Val Ser Leu Gly Thr Arg Gin Pro 
15 10 15 

Val Trp Lys Lys Leu His Val Ser Ser Gly Phe Phe Ser Gly Leu Gly 
20 25 30 

Leu Phe Leu Leu Leu Leu Ser Ser Leu Cya Ala Ala Ser Ala Glu Thr 
35 40 45 

Glu Val Gly Ala Met Val Gly Ser Asn Val Val Leu Ser Cys He Asp 
50 55 60 
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Pro His Arg Arg Hi© Phe Asn Leu Ser Gly Leu Tyr Val Tyr Trp Gin 
65 70 75 80 

lie Glu Asn Pro Glu Val Ser Val Thr Tyr Tyr Leu Pro Tyr Lys Ser 
85 90 95 

Pro Gly He Asn Val Asp Ser Ser Tyr Lys Asn Arg Gly His Leu Ser 
100 105 110 

Leu Asp Ser Met Lys Gin Gly Asn Phe Ser Leu Tyr Leu Lys Asn Val 
115 120 125 

Thr Pro Gin Asp Thr Gin Glu Phe Thr Cys Arg Val Phe Met Asn Thr 
130 135 140 

Ala Thr Glu Leu Val Lys lie Leu Glu Glu Val Val Arg Leu Arg Val 
145 150 155 ISO 

Ala Ala Asn Phe Ser Thr Pro Val lie Ser Thr Ser Asp Ser Ser Asn 
165 170 175 

Pro Gly Gin Glu Arg Thr Tyr Thr Cys Met Ser Lys Asn Gly Tyr Pro 
180 IBS 150 

Glu Pro Asn Leu Tyr Trp He Asn Thr Thr Asp Asn Ser Leu He Asp 
195 200 205 

Thr Ala Leu Gin Asn Asn Thr Val Tyr Leu Asn Lys Leu Gly Leu Tyr 
210 215 220 

Asp Val He Ser Thr Leu Arg Leu Pro Trp Thr Ser Arg Gly Asp Val 
225 230 23S 240 

Leu Cys Cys Val Glu Asn Val Ala Leu His Gin Asn He Thr Ser He 
245 250 255 

Ser Gin Ala Glu Ser Phe Thr Gly Asn Asn Thr Lys Asn Pro Gin Glu 
260 265 270 

Thr His Asn Asn Glu Leu hys Val Leu Val Pro Val Leu Ala Val Leu 
275 280 285 

Ala Ala Ala Ala Phe Val Ser Phe He He Tyr Arg Arg Thr Arg Pro 
250 295 300 

His Arg Ser Tyr Thr Gly Pro Lys Thr Val Gin Leu Glu Leu Thr Asp 
305 310 315 320 

His Ala 



<210> 3 

<211> 1759 

<212> DMA 

<213> Mus rausculus 

<220> 

<221> CDS 

<222> (1) . . (1041) 

<400> 3 
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10 



15 



25 



atg cag eta aag tgt ccc tgt ttt gtg tec ttg gga acc agg cag cct 48 
Met Gin Leu Lys Cys Pro Cys Phe Val Ser Leu Gly Thr Arg Gin Pro 
X 5 10 15 

gtt tgg aag aag etc cat gtt tct age ggg ttc ttt tct ggt ctt ggt 96 
Val Trp Lys Lys Leu His Val Ser Ser Gly Phe Phe Ser Gly Leu Gly 
20 25 30 

ctg ttc ttg ctg ctg ttg age age etc tgt get gee tct gca gag act 144 
Leu Phe Leu Leu Leu Leu Ser Ser Leu Cys Ala Ala Ser Ala Glu Thr 
35 -40 45 

gaa gtc ggt gca atg gtg ggc age aat gtg gtg etc age tgc att gac 192 
Glu Val Gly Ala Met Val Gly Ser Asn Val Val Leu Ser Cys He Asp 
50 55 60 

ccc cac aga cgc cat ttc aac ttg agt ggt ctg tat gtc tat tgg caa 240 
Pro His Arg Arg His Phe Asn Leu Ser Gly Leu Tyr Val Tyr Trp Gin 
65 70 75 SO 

ate gaa aac cca gaa gtt teg gtg act tac tac ctg cct tac aag tct 2&S 
lie Glu Asn Pro Glu Val Ser Val Thr Tyr Tyr Leu Pro Tyr Lys Ser 
85 90 95 

cca ggg ate aat gtg gac agt -tec tac aag aae agg ggc cat ctg tec 33 6 
Pro Gly lie Asn Val Asp Ser Ser Tyr Lys Asn Arg Gly His Leu Ser 
100 105 110 

ctg gac tec atg aag cag ggt aac ttc tct ctg tac ctg aag aat gtc 3B4 
Leu Asp Ser Met Lys Gin Gly Asn Phe Ser Leu Tyr Leu Lys Asn Val 
115 120 125 

acc cct cag gat acc cag gag ttc aca tgc egg gta ttt atg aat aca 432 
Thr Pro Gin Asp Thr Gin Glu Phe Thr Cys Arg Val Phe Met Asn Thr 
130 135 140 

gec aca gag tta gtc aag ate ttg gaa gag gtg gtc agg ctg cgt gtg 4 80 
Ala Thr Glu Leu Val Lys lie Leu Glu Glu Val Val Arg Leu Arg Val 
145 150 155 160 

gca gca aac ttc agt aca cct gtc ate age acc tct gat age tec aac 52 S 
Ala Ala Asn Phe Ser Thr Pro Val He Ser Thr Ser Asp Ser Ser Asn 
165 170 175 

cca ggc cag gaa cgt acc tac acc tgc atg tec aag aat ggc tac cca 576 
Pro Gly Gin Glu Arg Thr Tyr Thr Cys Met Ser Lys Asn Gly Tyr Pro 
ISO 185 190 

gag ccc aac ctg tat tgg ate aac aca acg gac aat age eta ata gac 624 

Glu Pro Asn Leu Tyr Trp He Asn Thr Thr Asp Asn Ser Leu He Asp 
195 200 205 

45 

acg get ctg cag aat aac act gtc tac ttg aac aag ttg ggc ctg tat 672 

Thr Ala Leu Gin Asn Asn Thr Val Tyr Leu Asn Lys Leu Gly Leu Tyr 

210 215 220 

gat gta ate age aca tta agg etc cct tgg aca tct cat ggg gat gtt 720 
50 Asp Val lie Ser Thr Leu Arg Leu Pro Trp Thr Ser His Gly Asp Val 

225 230 235 240 

ctg tgc tgc gta gag aat gtg get etc cac cag aac ate act age att 768 
Leu Cys Cys Val Glu Asn Val Ala Leu His Gin Asn He Thr Ser He 

55 



30 



40 
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245 250 255 

age cag gca gaa agt ttc act gga aat aac aca aag aac cca cag gaa 816 
Ser Gin Ala Glu Ser Phe Thr Gly Asn Asn Thr Lys Asn Pro Gin Glu 
260 265 270 

acc cac aat aat gag tta aaa gtc ctt gtc ccc gtc ctt get gta ctg 864 
Thr His Asn Asn Glu Leu Lys Val Leu Val Pro Val Leu Ala Val Leu 
275 280 285 

gcg gca gcg gca ttc gtt tec ttc ate ata tac aga cgc acg cgt ccc 912 
Ala Ala Ala Ala Phe Val Ser Fhe lie lie Tyr Arg Arg Thr Arg Pro 
290 235 300 

cac cga age tat aca gga ccc aag act gta cag ctt gaa ctt aca gac 960 
His Arg Ser Tyr Thr Gly Pro Lys Thr Val Gin Leu Glu Leu Thr Asp 
305 310 315 320 

act tgg get ccg gtc ccc tac cag gac tat ttg att cca aga tat ttg 1008 
Thr Trp Ala Pro Val Pro Tyr Gin Asp Tyr Leu He Pro Arg Tyr Leu 
325 330 335 

atg fcet cca tgc etc aaa aca cgt ggt tta cca taaaagecac tgtctcatct 1061 
Met Ser Pro Cys Leu Lys Thr Arg Gly Leu Pro 
340 345 

gttcagacca ctcaggctcc agccaggtgc cagaagtccc acttaccgag tctactgagc 1121 

acaagctatg taatgggtct gctctgctcc agcagcatag aacccccaag ccccaggtta 1181 

agacattttc aatgagcagg aacccaacca tactcacaga gctggagacc gagecagatg 1241 

cagaaaagaa ggcatgttcc agcccattac atagacatct gaggtgecac tggggagatc 13 01 

ccagagccca aattcaccgt gaatagtgtt tggtttcaga cccaggacaa gggactgagg 13 61 

tgcatatttt acaeatcaaa aeggacctgg cttccaggtt ctcccagcat ccctcagtcc 1421 

ctacctggca taccctgccc ccaaccctga actctccagc ccaggacctg ggctgccctt 1481 

cccccagagg ctcctcccta tataatccag acattttgtc tcctcctttc ctccctccca 1541 

ctctcttctt ttctctcgat gegatgetea tgegatgetc gatgetcatg ateaaatget 1601 

cccttctctc tttttctctc cctccccccc ttccacctct ttcctcacgg caactttcct 1661 

ggctttggtc ctagtgaact cactcacctg agagtgattc ccaataaacc cacctttata 1721 

taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1759 



<210> 4 
<211> 347 
<212> PRT 

<213> Mus musculus 
<400> 4 

Met Gin Leu Lys Cys Pro Cys Phe Val Ser Leu Gly Thr Arg Gin Pro 
1 5 10 15 

Val Trp Lys Lys Leu His Val Ser Ser Gly Phe Phe Ser Gly Leu <5ly 
20 25 30 
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Leu Phe Leu Leu Leu Leu Ser Ser Leu Cys Ala Ala Ser Ala Glu Thr 
35 40 45 

Glu Val Gly Ala Met Val Gly Ser Asn Val Val Leu Ser Cys lie Asp 
50 55 €0 

Pro His Arg Arg His Phe Asn Leu Ser Gly Leu Tyr Val Tyr Trp Gin 
65 70 75 SO 

lie Glu Asn Pro Glu Val Ser Val Thr Tyr Tyr Leu Pro Tyr Lys Ser 
85 90 95 

Pro Gly lie Asn Val Asp Ser Ser Tyr Lys Asn Arg Gly Hie Leu Ser 
100 105 110 

Leu Asp Ser Met Lys Gin Gly Asn Phe Ser Leu Tyr Leu Lys Asn Val 
115 120 125 

Thr Pro Gin Asp Thr Gin Glu Phe Thr Cys Arg Val Phe Met Asn Thr 
130 135 140 

Ala Thr Glu Leu Val Lys He Leu Glu Glu Val Val Arg Leu Arg Val 
145 150 155 160 

Ala Ala Asn Phe Ser Thr Pro Val He Ser Thr Ser Asp Ser Ser Asn 
165 ' 170 175 

Pro Gly Gin Glu Arg Thr Tyr Thr Cys Met Ser Lys Asn Gly Tyr Pro 
180 185 190 

Glu Pro Asn Leu Tyr Trp He Asn Thr Thr Asp Asn Ser Leu He Asp 
195 200 205 

Thr Ala Leu Gin Asn Asn Thr Val Tyr Leu Asn Lys Leu Gly Leu Tyr 
210 215 220 

Asp Val He Ser Thr Leu Arg Leu Pro Trp Thr Ser His Gly Asp Val 
225 230 235 240 

Leu Cys Cys Val Glu Asn Val Ala Leu His Gin Asn He Thr Ser He 
245 250 255 

Ser Gin Ala Glu Ser Phe Thr Gly Asn Asn Thr Lys Asn Pro Gin Glu 
260 265 270 

Thr His Asn Asn Glu Leu Lys Val Leu Val Pro Val Leu Ala Val Leu 
275 280 285 

Ala Ala Ala Ala Phe Val Ser Phe He He Tyr Arg Arg Thr Arg Pro 
290 295 300 

His Arg Ser Tyr Thr Gly Pro Lys Thr Val Gin Leu Glu Leu Thr Asp 
305 310 315 320 

Thr Trp Ala Pro Val Pro Tyr Gin Asp Tyr Leu He Pro Arg Tyr Leu 
325 330 335 

Met Ser Pro Cys Leu Lys Thr Arg Gly Leu Pro 
340 345 
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<210> 5 

<211> 953 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (24) . . (950) 

<400> 5 

ggcccgaggt ctccgcccgc acc atg egg ctg ggc agt cct gga ctg etc ttc 53 

Met Arg Leu Gly Ser Pro Gly Leu Leu Phe 
15 10 



ctg etc ttc age age ctt cga get gat act cag gag aag gaa gtc aga 101 
Leu Leu Phe Ser Ser Leu Arg Ala Asp Thr Gin Glu Lys Glu Val Arg 

15 is 20 as 

gcg atg gta ggc age gac gtg gag etc age tgc get tgc cct gaa gga 14 9 

Ala Met Val Gly Ser Asp Val Glu Leu Ser Cys Ala Cys Pro Glu Gly 
30 35 40 

20 

age cgt ttt gat tta aat gat gtt tac gta tat tgg caa acc agt gag 197 

Ser Arg Phe Asp Leu Asn Asp Val Tyr Val Tyr Trp Gin Thr Ser Glu 
45 ■ 50 55 

teg aaa acc gtg gtg acc tac cac ate cca cag aac age tec ttg gaa 24 5 
Ser Lys Thr Val Val Thr Tyr His He Pro Gin Asn Ser Ser Leu Glu 
60 65 70 

aac gtg gac age cgc tac egg aac cga gec ctg atg tea ccg gee ggc 293 

Asn Val Asp Ser Arg Tyr Arg Asn Arg Ala Leu Met Ser Pro Ala Gly 
75 80 85 90 

30 

atg ctg egg ggc gac ttc tee ctg cgc ttg ttc aac gtc acc ccc cag 341 

Met Leu Arg Gly Asp Phe Ser Leu Arg Leu Phe Asn Val Thr Pro Gin 
95 100 105 

gac gag cag aag ttt cac tgc ctg gtg ttg age caa tec etg gga ttc 389 
35 Asp Glu Gin Lys Phe His Cys Leu Val Leu Ser Gin Ser Leu Gly Phe 

110 115 120 

cag gag gtt ttg age gtt gag gtt aca ctg cat gtg gca gca aac ttc 437 

Gin Glu Val Leu Ser Val Glu Val Thr Leu His Val Ala Ala Asn Phe 

125 130 135 

40 

age gtg ccc gtc gtc age gee ccc cac age ccc tec cag gat gag etc 485 

Ser Val Pro Val Val Ser Ala Pro His Ser Pro Ser Gin Asp Glu Leu 

140 145 150 

acc ttc acg tgt aea tec ata aac ggc tac ccc agg cec aac gtg tac 533 
45 Thr Phe Thr Cys Thr Ser lie Asn Gly Tyr Pro Arg Pro Asn Val Tyr 

155 160 165 170 

tgg ate aat aag acg gac aac age ctg ctg gac cag get ctg cag aat 581 
Trp lie Asn Lys Thr Asp Asn Ser Leu Leu Asp Gin Ala Leu Gin Asn 
175 180 . 185 

50 

gac acc gtc ttc ttg aac atg egg ggc ttg tat gac gtg gtc age gtg 629 
Asp Thr Val Phe Leu Asn Met Arg Gly Leu Tyr Asp Val Val Ser Val 
190 195 200 
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ctg agg a he gca egg acc ccc age gtg aac att ggc tgc tgc ata gag 677 
Leu Arg He Ala Arg Thr Pro Ser Val Asn He Gly Cys Cys lie Glu 
205 210 215 

5 aac gtg ctt ctg cag cag aac ctg act gtc ggc age cag aca gga aat 725 

Asn Val Leu Leu Gin Gin Asn Leu Thr Val Gly Ser Gin Thr Gly Asn 
220 225 230 

gac ate gga gag aga gac aag ate aca gag aat cca gtc agt acc ggc 773 
Asp He Gly Glu Arg Asp Lys lie Thr Glu Asn Pro Val Ser Thr Gly 
10 235 240 245 250 

gag aaa aac gcg gec acg tgg age ate ctg get gtc ctg tgc ctg ctt 821 
Glu Lys Asn Ala Ala Thr Trp Ser He Leu Ala Val Leu Cys Leu Leu 
255 260 265 



15 



25 



30 



40 



50 



55 



gtg gtc gtg gcg gtg gec ata ggc tgg gtg tgc agg gac cga tgc etc 869 

Val Val Val Ala Val Ala He Gly Trp Val Cys Arg Asp Arg Cys Leu 
270 275 280 

caa cac age tat gca ggt gec tgg get gtg agt ccg gag aca gag etc 917 

Gin His Ser Tyr Ala Gly Ala Trp Ala Val Ser Pro Glu Thr Glu Leu 
285 230 295 

act gaa tec tgg aac ctg etc ctt ctg etc teg tga 953 

Thr Glu Ser Trp Asn Leu Leu Leu Leu Leu Ser 
300 305 



<2io> € 

<211> 309 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Arg Leu Gly Ser Pro Gly Leu Leu Phe Leu Leu Phe Ser Ser Leu 
1 5 10 15 

Arg Ala Asp Thr Gin Glu Lys Glu Val Arg Ala Met Val Gly Ser Asp 
20 25 30 

Val Glu Leu Ser Cys Ala Cys Pro Glu Gly Ser Arg Phe Asp Leu Asn 
35 40 45 

Asp Val Tyr Val Tyr Trp Gin Thr Scr Glu Ser Lys Thr Val Val Thr 
50 55 60 

Tyr His He Pro Gin Asn Ser Ser Leu Glu Asn Val Asp Ser Arg Tyr 
65 70 75 80 

Arg Asn Arg Ala Leu Met Ser Pro Ala Gly Met Leu Arg Gly Asp Phe 
85 90 95 

Ser Leu Arg Leu Phe Asn Val Thr Pro Gin Asp Glu Gin Lys Phe His 
100 105 110 

Cys Leu Val Leu Ser Gin Ser Leu Gly Phe Gin Glu Val Leu Ser Val 
115 120 125 

Glu Val Thr Leu His Val Ala Ala Asn Phe Ser Val Pro Val Val Ser 
130 135 140 
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Ala Pro His Ser Pro Ser Gin Asp Glu Leu Thir Phe Thr Cys Thr Ser 
■ 145 150 155 160 

He Asn Gly Tyr Pro Arg Pro Asn Val Tyr Trp lie Asn Lys Thr Asp 
165 170 175 

Asn Ser Leu Leu Asp Gin Ala Leu Gin Asn Asp Thr Val Phe Leu Asn 
130 185 190 

Met Arg Gly Leu Tyr Asp Val Val Ser Val Leu Arg lie Ala Arg Thr 
195 200 205 

Pro Ser Val Asn lie Gly Cys Cys lie Glu Asn Val Leu Leu Gin Gin 
210 215 220 

Asn Leu Thr Val Gly Ser Gin Thr Gly Asn Asp lie Gly Glu Arg Asp 
225 230 235 240 

Lys He Thr Glu Asn Pro Val Ser Thr Gly Glu Lys Asn Ala Ala Thr 
245 250 255 

Trp Ser He Leu Ala Val Leu Cys Leu Leu Val Val val Ala Val Ala 
260 265 270 

He Gly Trp Val Cys Arg Asp Arg- Cys Leu Gin His Ser Tyr Ala Gly 
275 280 285 

Ala Trp Ala Val Ser Pro Glu Thr Glu Leu Thr Glu Ser Trp Asn Leu 
290 295 300 

Leu Leu Leu Leu Ser 
305 



Claims 

1. An isolated nucleic acid molecule comprising the nucleotide sequence set forth in SEQ ID NO:1, 3, or 5. 

2. An isolated nucleic acid molecule encoding a polypeptide comprising the amino acid sequence set forth in SEQ ID 
NO:2, 4, or 6. 

3. An isolated nucleic acid molecule which encodes a naturally occurring allelic variant of a polypeptide comprising 
the amino acid sequence set forth in SEQ ID NO:2, 4, or 6. 

4. An isolated nucleic acid molecule comprising a nucleotide sequence which is at least 50% identical to the nucleotide 
sequence of SEQ ID NO:1 , 3, or 5 or a complement thereof selected from the group consisting of; 

a) a nucleic acid molecule comprising an isolated fragment of at least 500 nucleotides of a nucleic acid comprising 
the coding sequence of SEQ ID NO:1 , 3, or 5 or a complement thereof; 

b) a nucleic acid molecule which encodes a polypeptide comprising an amino acid sequence at least about 
60% homologous to the amino acid sequence of SEQ ID NO:2,4, or 6; and 

c) a nucleic acid molecule which encodes an isolated fragment of a polypeptide comprising the amino acid 
sequence of SEQ I D NO:2, 4, or 6, wherein the fragment comprises at least 1 5 contiguous amino acid residues 
of the amino acid sequence of SEQ ID NO:2, 4, or 6. 

5. An isolated nucleic acid molecule which hybridizes to the nucleic acid molecule of any one of claims 1 , 3, or 5 under 
stringent conditions. 

6. An isolated nucleic acid molecule comprising a nucleotide sequence which is complementary to the nucleotide 
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sequence of the nucleic acid molecule of any one of claims 1 , 3, or 5. 

7. An isolated nucleic acid molecule comprising the nucleic acid molecule of any one of claims 1, 3, or 5, and a 
nucleotide sequence encoding a heterologous polypeptide. 

8. A vector comprising the nucleic acid molecule of any one of claims 1 , 3, or 5. 

9. A vector comprising a nucleotide sequence encoding a portion of a GL50 molecule, wherein said portion encodes 
a GL50 cytoplasmic domain. 

10. The vector of claim 9, which is an expression vector. 

1 1 . A host cell transfected with the expression vector of claim 9. 

1 2. A method of producing a polypeptide comprising culturing the host cell of claim 11 in an appropriate culture medium 
to, thereby, produce the polypeptide. 

13. An isolated polypeptide selected from the group consisting of: 

a) an isolated fragment of a polypeptide comprising the amino acid sequence of SEQ ID NO:2, 4, or 6, wherein 
the fragment comprises at least 15 contiguous amino acids of SEQ ID NO:2, 4, or 6; 

b) a naturally occurring allelic variant of a polypeptide comprising the amino acid sequence of SEQ ID NO:2, 
4, or 6, wherein the polypeptide is encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO:1 , 3, or 5 under stringent conditions; 

c) a polypeptide which is encoded by a nucleic acid molecule comprising a nucleotide sequence which is at 
least 50 % identical to a nucleic acid molecule comprising the coding region of the nucleotide sequence of SEQ 
ID NO:1, 3, or 5; 

d) a polypeptide comprising an amino acid sequence which is at least 50% identical to the amino acid sequence 
of SEQ ID NO:2, 4, or 6. 

14. The isolated polypeptide of claim 13 comprising the amino acid sequence of SEQ ID NO:2, 4, or 6. 

15. The polypeptide of claim 14, further comprising heterologous amino acid sequences. 

16. The polypeptide of claim 15, wherein the heterologous amino acid sequences are derived from an immunoglobulin 
molecule. 

17. A soluble polypeptide comprising an extracellular domain of a GL50 molecule. 

18. The soluble polypeptide of claim 17, which is an Ig fusion polypeptide. 

19. An antibody which selectively binds. to a polypeptide of claim 13. 

20. A method for modulating the immune response comprising administering a GL50 modulating agent to a subject 
such that the immune response of the subject is modulated. 

21. The method of claim 20, wherein the immune response is upmodulated. 

22. The method of claim 20, wherein the immune response is downmodulated. 

23. A method for modulating the immune response comprising administering an antibody which binds to a GL50 polypep- 
tide to a subject such that the immune response of the subject is modulated. 

24. The method of claim 23, further comprising administering at least one antibody which binds to a B7-1 or B7-2 molecule. 

25. A method for modulating T cell costimulation comprising contacting an activated T cell with a GL50 polypeptide 
such that T cell costimulation is modulated. 
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26. A method for detecting the presence of a polypeptide of claim 13 in a sample comprising: 

a) contacting the sample with a compound which selectively binds to the polypeptide; and 

b) determining whether the compound binds to the polypeptide in the sample to thereby detect the presence of 
a polypeptide of claim 13 in the sample. 

27. A method for reducing the proliferation of a tumor cell comprising contacting an immune cell with an activating form 
of a GL50 molecule such that an immune response to the tumor cell is enhanced and proliferation of the tumor cell 
is reduced. 

28. The method of claim 27, wherein the activating form of a GL50 molecule is a soluble polypeptide comprising the 
extracellular domain of GL50. 

29. The method of claim 27, wherein the activating form of a GL50 molecule is a cell associated polypeptide comprising 
the extracellular domain of GL50. 

30. A method for screening for a compound which modulates GL50 mediated activation of an immune cell comprising: 
i) contacting a polypeptide comprising at least one GL50 polypeptide domain with a test compound and a GL50 
binding partner and ii) identifying compounds that modulate the interaction of the polypeptide with the GL50 binding 
partner to thereby identify compounds that modulate GL50 mediated activation of an immune cell. 

31. The method of claim 30, wherein the polypeptide comprises a GL50 domain selected from the group consisting of: 
a transmembrane domain, a cytoplasmic domain, and an extracellular domain. 

32. The method of claim 30, wherein the domain is a splice variant of a GL50 cytoplasmic domain. 

33. The method of claim 30, wherein the GL50 polypeptide domain comprises at least one amino acid substitution. 

34. A method for screening for a compound which modulates signal transduction in an immune cell comprising contacting 
an immune cell that expresses a GL50 molecule with a test compound and determining the ability of the test 
compound to modulate signal transduction via GL50 to thereby identify a compound with modulates a signal in an 
immune cell. 
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FIG. 4A 
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FIG. 14B 
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